nature 


THE INTERNATIONAL WEEKLY JOURNAL OF SCIENCE 


the bone 


Small-angle X-ray scattering 
and computer tomography 
combine to visualize hard 
tiSSue PAGES 308,349 & 353 


SPROEIME GEOMETRY ‘ENEALRELATITY a, = 
19 Nowe mber 2015 £10 
ABRUSH EINSTEIN’S LOGICIAN VoLS27.N 7B 
WITH GRAVITY HELPERS AT PLAY 
entanglement Rivals and colaborawrs who Lewis Carvoll’s adventures a 
versus relattvity made the theory possible in Wonderlandand beyond | 
PACE 290 PARE 298 PAGE 302 


THIS WEEK 


SPORT Science is no antidote 
to doping if the culture 
condones it p.276 


EDITORIALS 


WORLD VIEW Time to make CONSERVATION Saving 
universities more species from 
democratic p.277 extinction p.278 


cee 


Research for all 


Numbers on racial bias in research grants awarded by the US National Institutes of Health show 


that science has more to learn about inclusiveness. 


disparity in grants funded by the US National Institutes of 

Health (NIH), and show that for nearly 30 years, applicants 
from minorities have been less successful than white and mixed-race 
applicants in receiving funding (see page 286). The data, obtained by 
researchers through a Freedom of Information Act (FOIA) request to 
the agency, extend previous findings that showed racial disparities in 
NIH grants between 2000 and 2006. Those results had already led the 
agency to dedicate hundreds of millions of dollars’ worth of grants and 
programmes to try to rectify funding disparities. 

These disparities are not only unjust; they harm scientific and 
medical progress by shutting off funding to deserving scientists, and 
ultimately harm society and patients who would otherwise benefit 
from these scientists’ ideas. 

The big task now is to determine why racial funding disparities 
arise, and how to erase them. Researchers who have studied such 
disparities in science say that bias probably plays out in subtle and 
unsubtle ways in the grant-review process itself. 

In 2011, a team led by Raynard Kington, president of Grinnell 
College in Iowa, published a landmark paper reporting that black 
grant applicants were about two-thirds as likely as whites to be 
funded during 2000-06, even once factors such as publication 
history and training were taken into account (D. K. Ginther et al. 
Science 333, 1015-1019; 2011). 

Initially, the same study also found funding disparities between 
Asians and whites. But when the authors controlled for nativity — 
that is, when they attempted to distinguish Asians who were born 
and educated in the United States from those who had immigrated 
after receiving some or all of their scientific training — they found 
that there was no disparity in funding between whites and Asians who 
were US citizens when they received their PhDs. 


D ata released this week paint a long-term picture of the racial 


Kington says that this finding shows that bias can work in more 
complex ways than along strict racial lines: for instance, in favour of 
native-born and against foreign-born grant applicants. Countering 
this effect requires changes to the peer-review process, but also action 
throughout scientists’ education, training and career trajectories. 

That will need measures to counter negative bias — the tendency 
to have stereotyped or unfavourable opinions about people belonging 

to particular groups — but it will also require 


“Bias can work strategies that take into account positive bias, 
inmore complex — which is the tendency to support, like and 
waysthanalong _ believe in people who are similar to, or have 
strict racial similar experiences to, oneself. Much of 
lines.” the disparity among funded researchers is 


thought to involve factors such as whether 
grant applicants trained at or work at the same institutions, study the 
same research questions and have published as much or in the same 
journals as peer reviewers. 

The NIH is working on some aspects of the issue — for instance, 
its National Research Mentoring Network aims to foster diversity 
through mentoring. Pulmonologist Esteban Burchard and epidemi- 
ologist Sam Oh at the University of California, San Francisco, who 
requested the NIH FOIA data, support ideas that would require NIH 
grant reviewers to score grant applications on factors such as how the 
researchers plan to recruit diverse populations and how well the appli- 
cations reflect the racial and ethnic make-up of the country. Another 
idea is to provide an administrative supplement to funded grants for 
the discovery of racial and ethnic differences in medicine. 

Attacking such disparities — both in the ranks of science itselfand at 
the level of the type of science funded — is a powerful idea. It deserves 
serious consideration as the NIH works to make sure that the research it 
funds is truly representative of the medical needs of the United States. = 


Defensive drives 


Researchers exploring ways to genetically alter 
wild populations are wise to air their plans. 


their work until it is published. But the authors of a paper pub- 

lished in Nature Biotechnology this week have been unusually 

chatty about their work, broadcasting their results over the past year 
and airing their plans for further research. 

Because the work would make it possible to modify the genetics of 

entire populations of organisms, it raises a host of ethical and safety 


() ften, scientists in a fast-moving field try to keep a tight lid on 


questions. The researchers consider it wise to prepare their colleagues 
and the public for the results to come, and to solicit suggestions from 
the community about how to execute such experiments safely. The 
technique in question is an engineered ‘gene drive’ — a system that 
can spread a mutation through a population much faster than normal. 
The practice could wipe out some insect-borne diseases, includ- 
ing malaria. But an accidental release could trigger unintended, 
ecosystem-wide consequences. As such, research that involves gene 
drives must be handled with utmost care. 

The paper, published on 16 November, could ease concerns about 
accidental releases (J. E. DiCarlo et al. Nature Biotechnol. http://doi. 
org/89h; 2015). A team of researchers from Harvard Medical School 
in Boston, Massachusetts, has demonstrated that gene drives can be 
engineered that will work in laboratory strains of yeast (Saccharomyces 
cerevisiae), but that are unlikely to function in wild populations. And 
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if one did escape, the team showed that the mutation can be undone 
by setting loose a second drive to ‘overwrite’ it. The results suggest 
that careful planning can reduce the risks while allowing gene-drive 
research to continue. 

The concept of a gene drive is decades old, but the technique’s 
application was hindered until the discovery a few years ago of a 
simple and versatile genome-editing system called CRISPR-Cas9. 
This system allows researchers to alter genomes with unprece- 
dented precision and to engineer the fundamental components of 
a gene drive that transmit a copy of the edited sequence to nearly 
all offspring. 

And because the CRISPR-Cas9 system is relatively easy to work 
with, the technology is now available to more laboratories than ever 
before. This is both a boon and a concern: it arms more great minds 
with a tool that could address serious public-health and environ- 
mental problems, but it also increases the chance that a laboratory 
might enter the field naive to the necessary safety precautions. 
This has understandably raised some safety concerns, and the US 
National Academies of Sciences, Engineering, and Medicine, for 
example, has convened a committee to evaluate uses of gene drives. 

With this in mind, the Harvard researchers have been careful 
to announce their experimental plans before they carry them out. 
The experiments published this week were alluded to in previous 
publications outlining safety precautions that could be taken. The 
authors gathered feedback from the community, and used this to 
boost the safety of their own experiments. They were also careful 


to develop these safeguards before carrying out key laboratory 
experiments to explore the use of gene drives against Lyme disease, 
which is transmitted to humans through ticks, and schistosomiasis, 
a scourge carried by parasitic worms and most often found in Africa. 
All of these experiments have been discussed openly, before they 

were carried out. 
Such openness is not standard practice. Scientific experiments 
are often subject to approval by institutional safety committees and 
funding agencies, but these discussions tend to 


“Research be carried out behind closed doors. The public is 
involving sometimes surprised by what emerges. Witness 
gene drives the reaction earlier this year, when researchers 
must be announced that they had used CRISPR-Cas9 to 
handled edit the genomes of human embryos (see Nature 
with utmost 520, 593-595; 2015). About three years ago, a 
care.” charged debate and research moratorium ensued 


when news broke that researchers intended to 
publish results showing how they had engineered the H5N1 influenza 
virus to make it more transmissible. 

The Harvard gene-drive researchers have learnt from these 
debacles, and recognized the need to alert the wider community to 
their plans so that discussions can take place, concerns can be aired 
and suggestions offered from all corners before the work is done. 
Scientists should watch closely to see whether this approach could 
serve as a template for other teams that take on the challenge of 
working in controversial fields. m 


Dope rules 


Science is beside the point when an entrenched 
culture in a sport supports scoundrels. 


a bombshell on the athletics world. In a scathing 335-page 

report issued on 9 November, the independent international 
agency alleged the existence of a far-reaching doping programme in 
Russian track and field that implicated government officials, sport- 
ing federations, coaches, athletes, scientists and doctors. Asa result 
of Russia's widespread and institutionalized doping programme, the 
2012 Olympic Games in London were effectively “sabotaged”, WADA 
concluded. 

Russian athletes and coaches have subsequently been suspended 
from international track-and-field competitions by the International 
Association of Athletics Federations, whose laissez-faire policies over 
the years contributed to the scandal, according to WADA. There is 
a real possibility that Russian athletes will be banned from the 2016 
Olympic Games in Rio de Janeiro, Brazil. 

Science, some argue, can lead the way in achieving clean, or at least 
cleaner, sport. In the face of improved tests to detect doping, it will 
become harder for athletes to ingest performance-enhancing drugs 
without getting caught. Many say that the answer might lie with the 
biological passport, which looks for changes in blood chemistry from 
an individual's ‘baseline’ profile that may be indicative of doping. Sus- 
pect blood profiles have been used to nab cheats in professional cycling 
and endurance sports such as biathlons. And there have been claims 
from scientists at cycling’s international federation, the UCI — itself 
subject to allegations of misconduct — that rampant blood doping 
became less common in the pro peloton (the elite professional cycling 
circuit) after biological passports were introduced. 

There is no doubt that science can play a major part in anti-doping 
efforts. But this can only happen once a governance system is in place 
that has a genuine interest in clean sport. A series of doping scandals 


Te month, the World Anti-Doping Agency (WADA) dropped 
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has shown that science is useless at catching cheats in a culture that 
doesn't really want to catch them — and in many cases is being used 
to help them. 

Sophisticated biological passports are futile in a culture that 
encourages a leading anti-doping scientist to destroy blood and urine 
samples by the hundreds, while extorting money from athletes to do 
so. This is what WADAs report alleges of Grigory Rodchenkoy, the 
former head of Russia’s leading anti-doping lab. And the UCI looked 
the other way as Lance Armstrong doped his way to seven successive 
(and now rescinded) Tour de France victories, according to a report 

issued this year by an independent commis- 


“Sports that sion appointed by the UCI. 

haven’t been Sports that haven't been roiled by doping 
roiled by doping scandals may not be looking hard enough. 
scandals may Financial incentives such as corporate 
not be looking sponsorships, broadcasting rights and 


merchandizing have the potential to discour- 
age strong and independent anti-doping 
programmes. Just look at professional cycling, which has been forced 
to confront its massive doping problems. The sport’s popularity has 
suffered, and spectacular performances such as those of Britain’s 
Chris Froome now raise just as many suspicions as celebrations. Just 
imagine if, during an international tennis tournament, a 150-m.p.h. 
serve raised eyebrows rather than awe. 

There will be calls for more and better anti-doping tests in the 
run-up to next year’s summer Olympics in Rio and in other high- 
profile competitions. These are genuinely needed, because dopers tend 
to bea step ahead. Officials will hail their cutting-edge laboratories 
full of gleaming mass spectrometers and haemocytometers, and brag 
about how many urine and blood tests these can process each day — 
never mind that savvy athletes tend to dope out of competition and 
in tiny doses that are nearly impossible to detect. 

If past attitudes are anything to go by, we can expect officials to hide 
behind science, while doing little to root out 
the fundamental problems that allow systemic 
sports doping to thrive. As Russia’s doping 
scandal shows us, it is much easier to change a test 
tube than it is to change a culture. m 


hard enough.” 
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Berkeley, who most memorably defined the role of a university 
administrator: to arrange parking for the staff, sex for the students 
and sports for the alumni. 

Kerr’s throwaway line contained a kernel of truth: university 
administration is a necessary evil. Students and academics are the 
heart and soul of a university, and do its real work. The administrators, 
as his definition implies, merely facilitate. Well, these days, you would 
hardly know it. On campuses across the world, managerialism is 
on the march. The ancient power struggle between academics and 
administrators has lurched decisively in favour of the latter. 

Nowhere is this more true than in the United Kingdom, where 
reforms instigated by former prime minister Margaret Thatcher 30 years 
ago have enabled vice-chancellors to extend their 
grip over matters that were once controlled by 
academics, such as what subjects to teach, which 
kinds of grant to chase and criteria for hiring staff. 
By shifting decision-making to committees domi- 
nated by their close allies, many vice-chancellors 
now operate as if they were chief executives. 

This managerial approach, and the growing 
reach and expanse of administrative staff that has 
accompanied it, is gathering pace worldwide. That 
is largely because British and US universities dom- 
inate international university league tables, and 
many countries’ higher-education policies seek to 
emulate their model. Germany’s Excellence Initia- 
tive, for example, has selected a small number of 
promising institutions and given them the money 
to build up stronger central administrations. 

I am not convinced that any of this is in the 
interests of students, teachers or the wider communities that universities 
are supposed to serve. Iam a UK resident, but share continental suspi- 
cions of league tables, most of which originated in the English-speaking 
world and reflect the strengths of institutions there. At Europe-wide 
meetings, I often empathize with the audience's evident bemusement 
at the self-regard with which British and US speakers comport them- 
selves. I fear that, left unchecked, the high-handed behaviour of some 
vice-chancellors will do to UK universities what the bearers of the same 
corporate outlook have done to our once-respected high-street banks. 
Just as local bank managers are a thing of the past, stubbornly independ- 
ent academics can be flushed out by aggressive administrators, with their 
management speak and freshly minted data on research performance. 

Most of the resistance to these changes has come from the arts and 
humanities. Critics such as literary scholar Stefan 


I t was Clark Kerr, a former chancellor of the University of California, 


Collini have lambasted what they regard asthe NATURE.COM 
destruction of the UK university system in the _ Discuss this article 
London Review of Books and elsewhere. Scientists _ online at: 


have been largely favoured by the new regime, _go.tlafure.com/cozh8d 


GOVERNMENTS 
HAVE NEGLECTED 


DEMOCRACY 
AND ACADEMIC 
FREEDOM IN THEIR 
PUSH TO MAKE 
UNIVERSITIES MORE 


BUSINESSLIKE. 


~ lime tocry out for 
=8 academic freedom 


Giving staff and students a say in how institutions are run would strengthen 
governance and clip the wings of administrators, argues Colin Macilwain. 


and, perhaps asa result, their leaders have been mute on the issue. 

That cannot last. Research funding in the United Kingdom has been 
frozen for five years and, after next week’s government spending review, 
will probably start to decline. About to enter the fray is the proposed 
Teaching Excellence Framework. This is modelled on the Research 
Excellence Framework for evaluating university research, which has 
dominated British higher education since 1986 and has been widely 
adopted overseas. The pressures placed on academics by often-bogus 
metrics continue to mount. 

Here in Scotland, and rather against the tide, an effort is under way 
to give staff and students more influence over how universities are 
run. A higher-education governance bill proposed by the devolved 
government (see go.nature.com/qeqphw) would mandate elections 
of those wishing to chair university governing 
bodies, and would require the inclusion of staff 
and student unions on major committees. The bill 
takes its cue from a 2012 report led by Ferdinand 
von Prondzynski, principal and vice-chancellor 
of Robert Gordon University in Aberdeen, and 
includes measures to guarantee academic free- 
dom under the law (see go.nature.com/jvoma5). 

The modest reforms in the bill are being fought 
tooth and nail by other university principals. Their 
association, Universities Scotland, has worked 
hard to create the impression that academics 
are happy with the current governance arrange- 
ments. It has argued that parts of the bill threaten 
the institutions’ autonomy and would endanger 
their charitable status. It has even won the backing 
of Edward Snowden, the celebrated US whistle- 
blower and student-elected rector of the Univer- 
sity of Glasgow, who has tweeted opposition to the bill from his bunker 
in Moscow. But the proposal merely takes the process that got Snowden 
his position at Glasgow and extends it to other, newer institutions. 

The Scottish National Party government and the opposition Labour 
Party are both likely to support passage of the bill, modified somewhat 
to protect institutions’ autonomy. So universities in this corner of the 
world, at least, are set to become a touch more democratic. Id like to see 
that movement spread to other places, where governments have been 
neglecting the concepts of democracy and academic freedom in their 
relentless push to make universities more businesslike. The cult of the 
chief executive has already permeated almost every corner of our soci- 
ety. But universities are not corporations. They are, by definition, self- 
organized bodies of academics. Student and staff representation on key 
university committees will remind administrators of this fact, call them 
to account — and, in the long run, strengthen university governance. = 


Colin Macilwain writes about science policy from Edinburgh, UK. 
e-mail: cfmworldview@googlemail.com 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


Boosting ‘gene 
drive’ safety 


Researchers have developed 
a way to reduce the risks of 
a method that genetically 
engineers entire populations 
with unprecedented speed. 
‘Gene drives are genetic 
changes, based on inserting 
parts of the CRISPR-Cas9 
genome-editing system into 
a host genome, that spread 
through a population more 
rapidly than do normal 
mutations. Gene drives 
could be used to wipe out 
disease-carrying insects, for 
example, but could also spread 
uncontrollably in an ecosystem. 
To reduce this risk, Kevin 
Esvelt and George Church of 
Harvard Medical School in 
Boston, Massachusetts, and 
their team inserted the bacterial 
DNA-cutting Cas9 enzyme 
into a piece of DNA external 
to the Saccharomyces cerevisiae 
yeast genome, and put the 
guide RNAs for directing Cas9 
to a specific DNA sequence 
into the genome. This 
separation ensured that the 
gene drive would not spread 
exponentially if the strain was 
released into the wild. 
Nature Biotech. http://doi. 
org/89h (2015) 


ANIMAL BEHAVIOUR 


Polarized light as 
a secret signal 


Some crustaceans can 
detect polarized light, using 
it as a covert signal that is 


CONSERVATION 


How to save a species 


Common factors such as law enforcement 
contribute to the success or failure of species- 
recovery programmes, suggesting that 
conservation lessons could be generalized 
across different populations or species. 
Jennifer Crees at the Zoological Society 
of London and her colleagues analysed 
48 mammalian conservation programmes, 
ranging from the successful protection of the 
golden lion tamarin (pictured) to the failed 
attempt to save the Yangtze River dolphin. 


invisible to predators. 

Yakir Luc Gagnon at the 
University of Queensland in 
Brisbane, Australia, found that 
the bodies of mantis shrimps 
(Gonodactylaceus falcatus; 
pictured) reflect a distinctive 
pattern of circular polarization 
(pictured in red) that is visible 
only to other shrimps. When 
presented with different 
burrows in the laboratory, 
mantis shrimps avoided 
or delayed entering those 
that were lit with circularly 
polarized light compared with 
those under unpolarized light. 
This suggests that the shrimps 
use polarized light cues to 
sense whether potential 
burrows are occupied. 

In another study, Martin 
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Howat the University of 
Bristol, UK, found that 

male fiddler crabs (Uca 
stenodactylus) detected 
polarized targets in the wild 
from farther away than non- 
polarized ones. The animals’ 
sensitivity to polarized light 
could be boosting the visual 
contrast between crabs and 
their mudflat habitat. 

Curr. Biol. http://doi.org/89c 
(2015); http://doi.org/89d (2015) 


Antibiotics make 
MRSA worse 


Antibiotics could help a 
drug-resistant pathogen to 
worsen inflammation. 


Whereas national-level legislation did not 
necessarily lead to good outcomes, intensity of 
local law enforcement did. Moreover, reducing 
the threats to animals — such as habitat loss and 
hunting — was crucial for long-term survival. 
No link was found between the outcomes of 
recovery programmes and biological factors 
such as body mass and habitat type, suggesting 
that well-designed conservation programmes 
should work across different species. 
Conserv. Biol. http://doi.org/87v (2015) 


Methicillin-resistant 
Staphylococcus aureus (MRSA) 
resists most B-lactam 
antibiotics by acquiring a 
protein that modifies the cell 
wall. David Underhill, George 
Liu and their team at Cedars- 
Sinai Medical Center in Los 
Angeles, California, thought 
that this modification might 
also boost the production of 
inflammatory molecules called 
cytokines in the host. They 
exposed mouse and human 
immune cells to MRSA and 
found that the host cells made 
higher levels of a cytokine 
called IL-18 when MRSA had 
been grown in the presence of 
B-lactams. In mice, treatment 
with a B-lactam caused more 
immune cells to flood the site 


KIKE CALVO/NATIONAL GEOGRAPHIC/GETTY 


WITH PERMISSION FROM ELSEVIER 


DANIEL BELTRA/GREENPEACE 


of an MRSA skin infection, 
resulting in more inflammation 
and larger abscesses than in 
MRSA- infected mice that were 
not treated with the antibiotic. 
MRSA infections are still 
sometimes treated with 
B-lactams, but these should 
be used with other antibiotic 
classes, the authors write. 
Cell Host Microbe 18, 604-612 
(2015) 


MUSCLE BIOLOGY 


Dog saved from 
muscular disease 


A golden retriever with the 
mutation for Duchenne 
muscular dystrophy was found 
to have working muscles 
because of a compensatory 
mutation in another gene. 

The dog, Ringo, was bred by 
researchers to have the mutated. 
version of a protein called 
dystrophin, but he still had 
normal muscles. Louis Kunkel 
at Boston Children’s Hospital 
in Massachusetts, Mayana 
Zatz at the University of Sao 
Paulo in Brazil and their team 
analysed the genomes of Ringo 
and one of his male offspring 
that also had the mutation 
and normal muscles. They 
identified a separate mutation 
in a development gene, Jagged1, 
that resulted in higher levels of 
Jagged 1 in Ringo and his son 
than in 31 affected dogs. 

This mutation may 
compensate for the muscle- 
regeneration problems caused 
bya lack of dystrophin, the 
authors suggest. 

Cell http://doi.org/87s (2015) 


BIOTECHNOLOGY 


Adult cells edited 
and reprogrammed 


A one-step procedure can 
correct genetic mutations in 
body cells and reprogram them 
into stem cells. 

Stem cells derived from 
patients’ tissues could generate 
replacement tissue that is not 
rejected by the immune system. 
Current methods have stem- 
cell yields of only 0.5-0.9%, 
and require extra steps to 
correct any mutations. Sara 


Howden, now at the Murdoch 
Children’s Research Institute 
in Parkville, Australia, and her 
team introduced into cells a 
mix of genes that induce stem- 
cell formation and encode the 
components of the CRISPR- 
Cas9 gene-editing system. 

They targeted mutations in 
cells from two people — an 
adult with a degenerative 
retinal disease and an infant 
with severe combined 
immunodeficiency — and 
made stem cells without the 
defect. The method produced 
stem cells with 5-8% efficiency. 
Stem Cell Rep. http://doi.org/87t 
(2015) 


NATURAL HISTORY 


Selenium linked to 
mass extinctions 


Plummeting ocean reserves of 
selenium could have played a 
part in past mass extinctions. 

Selenium and other trace 
elements help certain enzymes 
to function and perform 
other essential biochemical 
duties in organisms. John 
Long at Flinders University 
in Adelaide, Australia, and 
his team estimated ocean 
selenium levels over the past 
560 million years by analysing 
it in marine pyrite samples. 
Selenium concentrations 
fluctuated drastically, but sharp 
drops coincided with several 
mass extinctions — including 
one at the end of the Triassic 
200 million years ago. 

Crashes in selenium levels 
may have acted in concert 
with changes in oxygen and 
carbon cycles to drive mass 
extinctions, the authors say. 
Gondwana Res. 
http://doi.org/834 (2015) 


Super-thin 
superconductor 


A layer of niobium diselenide 
(NbSe,) just a few atoms thick 
can conduct electricity with 
zero resistance. 

Most 3D superconducting 
materials lose this ability 
once they are in their 2D 
form. Miguel Ugeda at the 


RESEARCH HIGHLIGHTS MiiiSaiiaa¢ 


SOCIAL SELECTION 


Populeripice 


on social media 


How to judge scientists’ strengths 


When the director ofa research institute asked his Twitter 
followers for a practical way to dig out promising candidates 
from the hundreds of applications sitting on his desk, 
they responded in spades. Ewan Birney, co-director of the 
EMBL European Bioinformatics Institute in Hinxton, UK, 
tweeted that he was procrastinating over how to shortlist the 
applications, which together listed around 2,500 research 
papers. (The process is ongoing, so Birney would not say 
exactly what the researchers were applying for.) He tweeted: 
“T get *genuinely* stuck here. If] am not going to use journal 
title as a proxy for quality, what do I do?” 


> NATURE.COM 
For more on 

popular papers: 
go.nature.com/pimBhf 


nanoGUNE research centre in 
San Sebastian, Spain, Michael 
Crommie at the University 
of California, Berkeley, and 
their colleagues studied the 
behaviour of electrons ina 
single layer of NbSe,, grown on 
a bilayer of atom-thick carbon. 
As the team lowered the 
temperature to below -271°C, 
the material's resistance fell to 
zero. The authors say that the 
results confirm that NbSe, is a 
true 2D superconductor, a class 
of materials that could one 
day be used in tiny quantum 
computers and other devices. 
The team also saw ripples 
in electron density while the 
material was superconducting 
— an effect that some theories 
predict should not be possible. 
Nature Phys. http://doi.org/89g 
(2015) 


ENVIRONMENTAL SCIENCE 


Chemicals hinder 
oil-eating microbes 


Chemical dispersants added 
to spilled oil from the 2010 
Deepwater Horizon disaster in 
the Gulf of Mexico (pictured) 
may have made little 
difference to the rates at which 
microbes broke down the oil. 
The dispersants broke up 
the oil into smaller droplets 
to help sea-dwelling microbes 
to degrade it further. To study 
the chemicals’ effect on the 


Yoav Gilad, a geneticist at the University 
of Chicago, Illinois, tweeted: “Read the 
abstracts. Read the papers. Yes, ifit means 
2500 papers, then get a larger committee.” 


microbes, Samantha Joye at 
the University of Georgia in 
Athens and her colleagues 
created bottled mixtures of 
sea water, oil and dispersants 
that simulated environmental 
conditions during the spill. 
Mixtures of oil and sea 
water were dominated by 
Marinobacter species, which 
can degrade a wide range 
of hydrocarbons. But these 
populations dropped when 
dispersant was added, whereas 
Colwellia, which degrades 
dispersants, increased 
in abundance. Adding 
dispersants did not seem 
to change the rate at which 
hydrocarbons were broken 
down in the bottled samples. 
Proc. Natl Acad. Sci. USA 
http://doi.org/89f (2015) 
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SEVEN DAYS nese 


Ocata buyout 
Astellas Pharma in Tokyo 
will pay US$379 million 

for Ocata Therapeutics of 
Marlborough, Massachusetts. 
Formerly called Advanced 
Cell Technology, Ocata has 
struggled financially but 

has continued to develop 
treatments in which human 
embryonic stem cells are 
coaxed into becoming 
retinal cells. The company 
has used the cells to treat 
two types of degenerative 
blindness in small-scale 
clinical trials. In the United 
States, limiting trials to a few 
participants would hamper 
speedy commercialization, 
but Japan has a fast-track 
approval system that allows 
commercialization of stem- 
cell treatments after studies on 
a small number of people. 


Paris talks go ahead 
International climate talks in 
Paris will go ahead despite the 
13 November terrorist attacks 
that killed at least 129 people 
in the French capital. The 
climate conference will be 
held, with tightened security, 
because it is an “essential 
meeting for humanity’, French 
Prime Minister Manuel Valls 


The wind speed in kilometres 
per hour on HD 189733 b, 

a ‘hot Jupiter’ exoplanet 

19.3 parsecs away from 
Earth — and the first weather 
data from a planet outside 
our Solar System. 

Source: Louden, T. & Wheatley, P. J. 
Preprint at http://arxiv.org/ 
abs/1511.03689 (2015). 


Space junk splashes down safely 


A chunk of space debris re-entered Earth's 
atmosphere on 13 November. The fragment 
was too small to hurt anyone but just the right 
size to help scientists to practise tracking an 
incoming asteroid. Researchers on a chartered 
jet filmed the debris, which may have fallen 


said on 14 November. Some 
40,000 participants will gather 
for the United Nations climate 
summit from 30 November 

to 11 December. Almost 

120 government leaders will 
attend the meeting, which it 

is hoped will produce a global 
climate deal. 


Dark-matter hunt 


The world’s most sensitive 
detector for dark matter was 
inaugurated on 11 November 
at the Gran Sasso National 
Laboratory, run by Italy’s 
National Institute for Nuclear 
Physics. Dark matter is thought 
to make up 85% of matter in 
the Universe. The experiment, 
called XENONIT, will monitor 
3.5 tonnes of liquid xenon, to 
try to detect the tiny amount 

of energy that is given off 
when dark matter interacts 
with atoms of ordinary 

matter. The collaboration 
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involves 125 scientists, and the 
experiment is expected to start 
collecting data by the end of 
March 2016. 


Nuclear burial 
Finland's government 
approved the construction of 
a deep underground facility 
to permanently store spent 
nuclear fuel on 12 November. 
Minister of economic affairs 
Olli Rehn said the move was a 
world first. The repository will 
dispose of up to 6,500 tonnes 
of uranium — high-level 
waste produced by nuclear- 
power facilities — by packing 
it into copper canisters and 
burying these in a clay buffer 
400 metres underground. 
The local government has 
already given its go-ahead for 
the facility, which will be on 
Olkiluoto island off Finland’s 
west coast and is due to open 
around 2023. 


offa lunar spacecraft, as it disintegrated 
above the Indian Ocean near Sri Lanka. 
NASA astronomer Peter Jenniskens says the 
successful campaign proves that it is possible 
to gather data about an object targeting the 
planet, even with short notice. 


Pesticide risk 

The world’s most widely 

used herbicide, glyphosate, is 
unlikely to pose a cancer risk to 
humans, according to a report 
published on 12 November 

by the European Food Safety 
Authority (EFSA). In its report, 
the agency set limits on how 
much glyphosate a person may 
safely ingest in a short period 
of time. EFSA’s finding comes 
nearly eight months after the 
World Health Organization's 
International Agency for 
Research on Cancer said that 
glyphosate probably does 
cause cancer in humans. See 
go.nature.com/mb8b4l for 
more. 


Space mining is go 
On 10 November the US 
Senate passed the Space Act of 
2015, allowing US citizens the 
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rights to any materials that 
they gather from asteroids or 
other space-based resources. 
However, space miners will 
also have to comply with the 
1966 Outer Space Treaty, 

an international agreement 
that states: “outer space 

is not subject to national 
appropriation by claim 

of sovereignty”. The act 

also extends the use of the 
International Space Station 
from 2020 to at least 2024. 


Pipeline questions 


Ina letter to transport minister 
Marc Garneau, Canadian 
Prime Minister Justin Trudeau 
on 13 November called for 

a moratorium on crude-oil- 
tanker traffic along the north 
coast of British Columbia. 

The move raises questions 
about a pipeline project by 

the Calgary-based energy- 
delivery firm Enbridge to carry 
oil from Alberta's tar sands 

to the coast for shipment. 
Environmentalists said that a 
moratorium would effectively 
halt the pipeline, but Enbridge 
said that it still hopes to discuss 
the plan with the prime 
minister. 


Reef protected 


Laws passed on 12 November 
in Queensland, Australia, 

will protect the Great Barrier 
Reef (pictured) from port 
development, the state’s 
development minister said. 
The laws ban disposal at sea of 


TREND WATCH 


The number of controversial fish 
aggregating devices (FADs) being 
used in the oceans is rising. Using 
data from tuna-fishing boats 

(see chart), a6 November report 
from the Pew Charitable Trusts 
estimates that between 81,000 and 
121,000 FADs were set adrift in 
2013 — and their use is growing 
(see go.nature.com/ngeubv). 
These FADs are free-floating, so 
fish and other animals shelter 
underneath and become easier to 
catch. But researchers warn that 
the devices encourage overfishing, 
and kill vulnerable species. 


material dredged from ports in rules that the agency released 
the region, and stop any new on 13 November requires 
ports being developed in the farmers to train their workers 
reef World Heritage Area. They in proper hygiene, and to 
form part of commitments test crop-irrigation systems 
made by Australia to safeguard for pathogens, among other 
the reef after the United things. But the regulations 
Nations Educational, Scientific are less stringent than a 2013 
and Cultural Organization FDA proposal that farmers 
considered categorizing found too burdensome. 

the coral zone as ‘at risk. Another of the rules creates a 
Plans to dispose of dredging programme to allow auditors 
material near the reef have to assess imported food and 
proved controversial in recent the overseas facilities that 
years, and conservation produce it. 


groups welcomed last week’s 


legislation, Puno 


Food rules Olive aid 

For the first time, crop The European Commission 
farmers in the United has announced a €7-million 
States will have to answer (US$7.5-million) call for 

to the Food and Drug proposals for research 
Administration (FDA) as into Xylella fastidiosa, the 
part of an effort to prevent aggressive plant pathogen 
food-borne illness. A set of that is destroying swathes 


‘FISH AGGREGATING DEVICE’ USE ON THE RISE 


Floating devices that could kill vulnerable species are being 
used increasingly, despite concerns from researchers and 
environmentalists, an analysis from tuna fishing boats shows. 


Numbers of FADS deployed (thousands) 


a | 
2006 2007 2008 2009 2010 2011 2012 2013 


SEVEN DAYS | THIS WEEK | 


19-20 NOVEMBER 
The inaugural Meta- 
Research Innovation 
Center at Stanford 
(METRICS) conference 
convenes in California 
to discuss how 
biomedical scientists 
plan, conduct and 
communicate research. 
go.nature.com/narepc 


23-27 NOVEMBER 
Ostend, Belgium, hosts 
the European Space 
Weather Week, a forum 
for space-weather 
forecasters and scientists. 
www.stce.be/esww2 


24-26 NOVEMBER 
An international 
immunotherapy 
conference meets in 
Brisbane, Australia. 
go.nature.com/235ing 


of olive trees in the Puglia 
region of southern Italy. The 
call will focus on methods 

of detection and control. 

The outbreak, which has 

also reached some regions of 
France, is a major economic 
threat to the European Union, 
but has received little research 
funding so far. Italian regional 
and national governments 
have also promised €6 million 
for X. fastidiosa research. 


Chile budget boost 


Chile’s Congress was mulling a 
budget increase of 150 million 
pesos (US$210,000) for the 
nation’s research-funding 
agency as Nature went to 
press. The move followed 
street protests by researchers 
after the resignation of 
Francisco Brieva, director of 
the National Commission for 
Scientific and Technological 
Research. The body funds 
more than 3,000 researchers. 
See go.nature.com/pbwtp8 for 
more. 
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Beluga whales are among the species that are thought to use Earth’s weak magnetic field for navigation. 


Long-sought ‘biocompass’ 
discovery claimed 


Protein complex offers explanation for how animals sense Earth’s magnetic pull. 


BY DAVID CYRANOSKI 


say that they have found a biological 

compass needle: a rod-shaped complex 
of proteins that can align with Earth’s weak 
magnetic field. 

The biocompass — whose constituent 
proteins exist in related forms in other species, 
including humans — could explain a long- 
standing puzzle: how animals such as birds 
and insects sense magnetism. It might also 
become an invaluable tool for using magnetic 
fields to control cells, report researchers led by 
biophysicist Xie Can at Peking University in 


E the cells of fruit flies, Chinese scientists 


Beijing, in a paper published on 16 November 
in Nature Materials (S. Qin et al. Nature Mater. 
http://doi.org/89v; 2015). 

“Tt’s an extraordinary paper, says Peter Hore, 
a biochemist at the University of Oxford, UK. 
But Xie’s team has not shown that the complex 
actually behaves as a biocompass inside liv- 
ing cells, nor explained exactly how it senses 
magnetism. “It’s either a very important paper 
or totally wrong. I strongly suspect the latter,” 
says David Keays, a neuroscientist who studies 
magnetoreception at the Institute of Molecular 
Pathology in Vienna. 

Many organisms — ranging from whales 
to butterflies, and termites to pigeons — use 


19 NOVEMBER 2015 | VOL 527 
© 2015 Macmillan Publishers Limited. All rights reserved 


Earth’s magnetic field to navigate or orient 
themselves in space. But the molecular mech- 
anism behind this ability, termed magneto- 
reception, is unclear. 

Some researchers have pointed to magneti- 
cally sensitive proteins called ‘cryptochromes, 
or ‘Cry’ Fruit flies lacking the proteins lose their 
sensitivity to magnetic fields, for example. But 
the Cry proteins alone cannot act as a compass, 
says Xie, because they cannot sense the polarity 
(north-south orientation) of magnetic fields. 

Others have suggested that iron-based 
minerals might be responsible. Magnetite, a 
form of iron oxide, has been found in the beak 
cells of homing pigeons. Yet studies suggest 
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> that magnetite plays no part in pigeon 
magnetoreception. 

Xie says that he has found a protein in fruit 
flies that both binds to iron and interacts with 
Cry. Known as CG8198, it binds iron and 
sulfur atoms and is involved in fruit-fly circa- 
dian rhythms. Together with Cry, it forms a 
nanoscale ‘needle’: a rod-like core of CG8198 
polymers with an outer layer of Cry proteins 
that twists around the core. 

Using an electron microscope, Xie’s team 
saw assemblies of these rods orienting them- 
selves in a weak magnetic field in the same way 
as compass needles. Xie gave CG8198 the new 
name of MagR, for magnetic receptor. 

The discovery offers scientists the prospect 
of using magnetic fields to control cells. Over 
the past decade, scientists have comman- 
deered the light-sensing capacity of some 
proteins to manipulate neurons, usually by 
inserting a fibre-optic cable directly into the 
brain — a tool called optogenetics. But mag- 
netosensing proteins have the advantage that 
they could be manipulated by magnetic fields 
outside the brain. 

Zhang Sheng-jia, a neuroscientist at Tsin- 
ghua University in Beijing, claims to have 
already demonstrated this ‘magnetogenetic’ 
capability. In September, he provided a 


surprise preview of Xie’s work when he pub- 
lished a paper reporting use of the biocompass 
to manipulate neurons in worms (X. Long et al. 
Sci. Bull. http://doi.org/883; 2015). Xie and 
others complained that Zhang’s early publi- 
cation violated a collaboration agreement 
between the two researchers — the details of 
which are disputed — and asked for it to be 

retracted. In October, 


“Tf MagR is the Zhang was fired from 
real magneto- his university, a deci- 
receptor, I'll sion that he is contest- 


eat my hat.” ing (see Nature http:// 
doi.org/882; 2015). 

Xie says that in April, he submitted a Chinese 
patent application that includes the use of 
magnetogenetics and the protein’s magnetic 
capacity to manipulate large molecules. He is 
also starting to look at the structure of MagR 
proteins in other animals, including humans. 
Variants in the human version of MagR might 
even relate to differences in people's sense of 
direction, he suggests. 


SCEPTICAL VOICES 

Other scientists are not convinced that the 
biological needles function like compasses 
in living organisms. Xie’s team has shown 
that MagR and Cry are produced in the same 


cells in pigeon retinas — the birds’ proposed 
magnetoreception centre — but MagR and Cry 
are found in many cells, says Keays. “With such 
a small amount of iron, one has to ask whether 
in vivo, at physiological temperatures, MagR is 
capable of possessing magnetic properties at all,” 
he says. “If MagR is the real magnetoreceptor, 
Tl eat my hat” 

Xie hopes that others will strengthen his case 
with further experiments, such as inactivating 
the gene for MagR in certain fruit-fly tissues to 
see whether it affects the animals’ sense of direc- 
tion. He published without doing this work, he 
says, because he just wanted to report the find- 
ings, which he has been working on for six years. 

The lack of an exact mechanism for how the 
protein complex senses magnetism, or how 
any signal it sends might be processed by the 
brain, gives some researchers pause. MagR’s 
biocompass activity might simply be the result 
of experimental contamination, says Michael 
Winklhofer, a magnetism specialist and Earth 
scientist at Ludwig Maximilian University of 
Munich in Germany. He is planning experi- 
ments to follow up on Xie’s team’s findings. If 
it holds up, says Winklhofer, then the discovery 
of MagR “appears to be a major step forward 
towards unravelling the molecular basis of 
magnetoreception’. m 


Space test for long-awaited 
sravitational-wave detector 


Europe’s LISA Pathfinder spacecraft has two metal cubes at its heart, which it will attempt 
to isolate from every force except for gravity. 


BY ELIZABETH GIBNEY 


here is a lot riding on the LISA Pathfinder 
"Tinision, an ambitious effort to test 

whether intricate technology designed 
to detect ripples in space-time can be deployed 
in space. 

Scheduled to launch on 2 December, the 
spacecraft is a long-awaited test-drive for 
a future €1-billion (US$1.1-billion) space 
observatory planned by the European Space 
Agency (ESA). The follow-up mission would 
track the largest objects in the Universe, includ- 
ing mergers between supermassive black holes 
and collisions between galaxies, by the space- 
time ripples that they create. 

First predicted by Albert Einstein almost 
exactly 100 years ago as part of his general 
theory of relativity (see nature.com/ 


relativity100), such gravitational waves have 
never been observed directly — let alone used 
to study the cosmos. There are already Earth- 
based observatories hunting these waves, but 
a space-based one would search for waves at 
the opposite end of the spectrum (see Nature 
525, 301-302; 2015). “It’s like having a radio 
telescope as well as an optical one,” says Karsten 
Danzmann, director of the Max Planck Institute 
for Gravitational Physics in Hanover, Germany, 
and co-principal investigator for the Pathfinder 
mission. “The part of the Universe you see is 
completely different.” 

The final space-based observatory will try to 
spot the stretching and compressing of space 
by bouncing laser beams between three masses 
floating in freefall, each separated from the 
others by some 5 million kilometres. Because 
the masses would be protected from all other 
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external forces, only a gravitational wave should 
disrupt the synchrony of their falling motion — 
a disturbance that would affect laser frequency. 

The LISA Pathfinder (named after the 
Laser Interferometer Space Antenna, the 
concept behind the gravitational-wave obser- 
vatory) is a smaller-scale test of this ultimate 
plan. With a pricetag of €400 million, it uses 
just two masses — each a 2-kilogram cube 
of gold and platinum — separated by a mere 
38 centimetres, which allows them to fit inside 
the same spacecraft. 

Unlike that of the observatory that it is 
designed to test-drive, this set-up is not sen- 
sitive enough to detect gravitational waves — 
instead, its purpose is to show that the masses 
can be completely isolated, and that any devia- 
tions in their relative motion can be measured 
with picometre accuracy. “We're missing out 


SOURCE: ESA/ATG MEDIALAB 


S. SHAPIRO, S. CONNELLY, 
A. KHAN & L. KONG, UIUC 


the 5 million kilometres, but so what?” says 
Paul McNamara, the mission's project scientist. 
“Pretty much everything that could affect our 
ability to measure gravitational waves is here” 

From the time of Pathfinder’s launch from 
ESA’s spaceport in Kourou, French Guiana, to 
the end of its subsequent eight-week journey, 
the masses will stay pinned to their housing 
deep inside the craft. But on arrival in orbit 
around a stable point between the Sun and 
Earth called Lagrange point 1, or L-1, about 
1.5 million kilometres away, the cubes will be 
gently released to float within the spacecraft 
(see ‘Precision lab in space’). 

Once in freefall, “the challenge is to isolate 
this little cube from everything around it, 
so the only thing it sees is space-time’, says 
McNamara. Expected disturbances are pres- 
sure from solar radiation and stray magnetic 
fields; the equipment is so precise that it should 
detect even a force equal to the weight of a 
small bacterium on Earth. 

As a high-precision laboratory in space, the 
LISA Pathfinder is unlike anything that ESA 
has done before, says Tim Sumner, an astro- 
physicist at Imperial College London who led 
the team that constructed one of the craft’s 
protection mechanisms. 

Another unusual element is that the major 
cargo — the cubes — will define the craft’s tra- 
jectory, rather than vice versa. As they orbit 
around L-1 and fall in microgravity, Pathfinder 
will deploy microthrusters that are so gentle, 
it would take around 1,000 to lift a piece of 
paper on Earth. The thrusters will monitor the 
cubes positions, ensuring that the craft hovers 
around the cubes without letting them touch 
its sides. Such a set-up required the teams who 
built the instruments and the engineers who 
made the craft to work together to an unprec- 
edented degree, says Sumner. 

These complexities go a long way towards 
explaining why the launch has taken so long 
to orchestrate, says Stefano Vitale, a physicist 
at the University of Trento in Italy, and a prin- 
cipal investigator for the Pathfinder mission; 
Pathfinder was approved by ESA in 2000 and 
originally intended for launch in 2006 (see 
Nature 469, 280; 2011). “Coarsely speaking, I 
think people underestimated the difficulty,’ says 
Vitale. “But that’s why you have a Pathfinder” 

The final step in the planned mission will 
test Pathfinder’s limits by instructing onboard 
instruments to tweak the internal temperature 


PRECISION LAB IN SPACE 


IN FOCUS 


LISA Pathfinder aims to test whether an intricate experiment consisting of two metal cubes in freefall, 
isolated from all forces except gravity, can operate in space. 


When Pathfinder launches, clamps pin the 
cubes — which are buried at the heart of the 
craft — tightly to their housing so that they 
don’t jostle and damage either themselves or 
other instruments. 


J (ex 


Two hours after launch, Pathfinder 
separates from the launcher and begins 
to make increasingly elongated ellipses. 


Pex 


Once the craft is stable, clamps release 
the cubes extremely gently; retractable 
devices position each one exactly at the 
centre of its housing at a speed of fewer 
than 5 micrometres per second. (See below). 


© 


Around 55 days into the mission, 
craft arrives in orbit around L-1. 


51 days after launch, Pathfinder 
separates from thrusters. 


Nine days after launch, Pathfinder makes a final 
burn, propelling it towards its destination — the 
stable point L-1, 1.5 million kilometres from Earth. 


At the heart of Pathfinder are 
two freefalling metal cubes, 
shielded from all forces except 
gravity by their housing. 


The housing monitors each 
cube’s position and commands 
the craft to move so that the 
cube is always at its centre. 


and magnetic and electrostatic fields to see how 

such changes affect the cubes. “We want to learn 

everything we can about the physics ofa free- 

floating body, and everything we learn will feed 

back into design of the future mission,” says 
McNamara. 

However, some 

opportunistic ESA 

scientists are already 

thinking about 

how Pathfinder’s 

instruments could 

be used to inform 

other problems 

once its main mission, which could take up to 

a year, is complete. Measurement of the gravi- 

tational constant, known as Big G, for exam- 

ple, should fall naturally out of Pathfinder’s 

data, Sumner says. Because the true value of 

Big G is disputed, a fresh measurement from 


Any disturbance to the 
relative motion of the 
cubes affects the 
frequency of the laser 
bouncing between them. 


The cubes float in 

a vacuum, surrounded 
by instruments that 
mitigate stray forces. 


space would provide useful perspective. 

To get a higher precision measurement, 
ESA may also consider extending the mis- 
sion — although Sumner says that scientists 
would only request this after Pathfinder has 
proved itself, a few months in. He and his col- 
leagues have also discussed using the craft’s 
thrusters to send it to a spot known as a saddle 
point, where the gravitational pulls of Earth 
and the Sun cancel each other out. This could 
reveal how gravity behaves at its lowest level 
possible in the Solar System, with little extra 
cost. Few scientists doubt that Einstein’s theo- 
ries hold, says Sumner, but it would be interest- 
ing to do the test nonetheless. 

Vitale, however, points out that itis important 
for researchers to stay focused on the mission's 
immediate goal. “Our main objective is to dem- 
onstrate freefall? he says, “and we don’t want to 
be distracted from that? m= 
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Satellites 
test general 
relativity 


Wayward craft find new use. 


BY ELIZABETH GIBNEY 


r Iwo satellites that were accidentally 
launched into the wrong orbit will be 
repurposed in the most stringent test 

yet of a prediction made by Albert Einstein’s 

general theory of relativity — that clocks run 
more slowly the closer they are to heavy objects. 

The satellites, operated by the European 
Space Agency (ESA), were mislaunched last 
year by a Russian Soyuz rocket that put them 
into elliptical, rather than circular, orbits. 
This left them unfit for their intended use as 
part of a European global-navigation system 
called Galileo. But the two craft have atomic 
clocks on board. According to general relativ- 
ity, the clocks’ ticking should slow down as the 
satellites move closer to Earth in their orbits, 
because the planet’s gravity bends the fabric 
of space-time. 

On 9 November, ESA announced that teams 
at Germany’s Center of Applied Space Tech- 
nology and Microgravity in Bremen and at the 
department of Time-Space Reference Systems 
at the Paris Observatory will track this accelera- 
tion and deceleration. By comparing the speed 
of the clocks’ ticking with the crafts’ known alti- 
tudes — pinpointed by lasers from monitoring 
stations on the ground — the researchers can 
test the accuracy of Einstein’s theory. 

In 1976, NASA launched an atomic clock 
aboard Gravity Probe A from Earth’s surface, 
sending it 10,000 kilometres into space, to 
compare its ticking with that of an identical 
clock on the ground. But that probe stayed in 
the air for just shy of two hours. The Galileo 
satellites, by contrast, will conduct experi- 
ments for a year. 

ESA expects the results to be four times more 
accurate than those of Gravity Probe A — ena- 
bling the agency to test whether theory agrees 
with reality to a precision of below 0.004%. 

No one expects Einstein's theory, which was 
published almost 100 years ago (see nature.com/ 
relativity100), to break down — it has passed 
every test thrown at it. A future ESA experiment 
called the Atomic Clock Ensemble in Space, or 
ACES, is scheduled to fly on the International 
Space Station in 2017. ACES will push Einstein's 
theory to even greater limits, testing it with a 
precision that could reach 0.0002%. m 
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Minority researchers in the United States consistently win NIH funding at lower rates than their peers. 


Racial bias haunts 
NIH grants 


Minorities are still less likely to get biomedical funding. 


BY ERIKA CHECK HAYDEN 


inority scientists are less likely 
M than their peers to have biomedi- 
cal research grants funded — and 
the disparity has barely changed in 30 years, 
according to data from the US National 
Institutes of Health (NIH). The numbers, 
requested by two California researchers to 
reignite discussion about diversity in the 
scientific workforce, show no consistent 
improvement, even though the proportion 
of minority grant reviewers has climbed. 
Pulmonologist Esteban Burchard and 
epidemiologist Sam Oh of the University of 
California, San Francisco, shared the data 
with Nature after obtaining them from the 
NIH through a request under the Freedom 
of Information Act. The figures show that 
under-represented minorities have been 
awarded NIH grants at 78-90% the rate of 
white and mixed-race applicants every year 
from 1985 to 2013 (see ‘Persistent gap’). 
Burchard and Oh had hypothesized 
that they might see an increase in funding 
for under-represented minorities after a 
1994 NIH mandate that investigators must 
include women and minorities in clinical 
studies. They reasoned that the increased 
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emphasis on minority health would create a 
virtuous circle by boosting grants to minority 
researchers, who, they posit, would be more 
likely to focus on those groups and help to 
fulfil the mandate. 

But there seems to have been no such 
increase. Asa result, Burchard and Oh worry 
that a racial divide could develop between 
researchers and the people they study. 
Burchard notes, for example, that a lack of 
diversity among trial participants may have 
caused problems for two drug companies 
that produced an anticlotting drug that had 
reduced efficacy in East Asians and Pacific 
Islanders. The attorney-general of Hawaii 
filed a lawsuit against the companies last year 
for failing to disclose the issue. “It’s a public- 
health problem, he says. 

In acommentary in PLoS Medicine next 
month, Burchard and Oh will argue that sci- 
entific workforce diversity helps to ensure that 
research addresses issues relevant to all. 

Other researchers say that airing the new 
data serves a purpose. “It raises the question 
in all of us as to the root causes of these dis- 
parities,” says David Burgess, a cell biologist 
at Boston College in Chestnut Hill, Massa- 
chusetts, who is lead principal investigator of 
the National Research Mentoring Network, 
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a US$19-million initiative announced by the 
NIH in October 2014 to improve mentorship 
of scientists from under-represented groups. 

But the data offer no clues to such questions, 
counters Raynard Kington, president of 
Grinnell College in lowa and former deputy 
director of the NIH. “It’s not surprising, not new, 
and doesn’t answer questions of how we can 
intervene to give every scientist the opportunity 
to contribute,” he says. In 2011, he co-authored 
a paper that found that black applicants for 
NIH funding were about two-thirds as likely as 
white people to receive grants during the years 
2000-06, even accounting for factors such as 
publication record and training (D. K. Ginther 
et al. Science 333, 1015-1019; 2011). 

He and others point to evidence that funding 
can be influenced by personal bias. In February, 
researchers who analysed nearly 19,000 North 
American faculty hiring decisions in computer 
science, business and history reported that elite 
institutions predominantly hire people who 
earn their doctorates from the same or other 
elite schools. One-quarter of the 461 institutions 
surveyed had trained 71-86% of tenure-track 
faculty, depending on the discipline (A. Clauset 
et al. Sci. Adv. 1, e€1400005; 2015). 

Such studies hold evidence of what biologist 
Margaret Werner-Washburne of the University 
of New Mexico in Albuquerque calls “positive 
bias” or in-group bias: the tendency for people 


PERSISTENT GAP 


Since 1985, the chances of winning a US National 
Institutes of Health grant have fallen overall, but the 
success rate for under-represented minorities has 
stayed below that of other races. 
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to favour other people and institutions that they 
know either personally or by reputation. 

“I think this happens a lot in the granting 
world,” she says. “Is it that when you're on a 
panel and you have to rate 15 grants from the 
top people in the field who have really pro- 
duced a lot, from major schools, you just want 
to root for them because their skill and poten- 
tial is so apparent, versus someone who isn't 
from that world?” 

Cardiologist Hannah Valantine, who became 
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the NIH’s first chief officer for scientific-work- 
force diversity in 2014, says that the agency 
is focused on demonstrating the benefits of 
diversity and how to achieve it. She adds that 
in response to Kington’s 2011 paper, the NIH 
has allocated more than $500 million to pro- 
grammes to evaluate how to attract, mentor and 
retain minority researchers. The agency is also 
studying biases that might affect peer review, 
and is interested in gathering data on whether a 
diverse workforce improves science. Although 
diversity benefits businesses and individual sci- 
entific investigators, it has not been shown to 
broaden the scope of research, says Valantine. 

“We can move forward with a premise 
that the diversity of scientists themselves is 
important,” she says. “But it behooves us as 
scientists to get the evidence that the diver- 
sity of scientists makes a difference to the 
output.” m SEE EDITORIALP.275 


CLARIFICATION 

The News story ‘Mega science prize split 
between more than 1,000 physicists’ (Nature 
527, 145; 2015) did not reflect Goran 
Hansson’s current role in the Nobel system. 
He is secretary-general of the Swedish 
Academy of Sciences, the body that awards 
the Nobel prizes in physics and chemistry. 
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EXOPLANETS 
THE NEXT 20 YEARS 


Researchers have found nearly 2,000 worlds 
beyond our Solar System. Now they hope to 
understand them. 


BY ALEXANDRA WITZE 
DESIGN BY JASIEK UKRZYSZTORIAK 


Twenty years ago this month, 
astronomers announced the discovery 
of 51 Pegasi b, the first confirmed 
planet orbiting a Sun-like star. The 
hellish gas giant orbits just beyond the 
searing heat of its parent star, and it 
opened astronomers’ eyes to the 
astonishing range of alien worlds 
that exist throughout the Galaxy. 
The tally of known extrasolar 
planets now stands at 1,978, with 
nearly 4,700 more candidates 
waiting to be confirmed. On 
29 November, exoplanet 
researchers will gather in Hawaii 
to review these extreme solar 
systems — and map out a path 
for the next two decades. 


The search so far 


By far the greatest haul of exoplanets has come from NASA's Kepler spacecraft 
(pictured above), which for four years stared at a small patch of the night sky in 
search of stars that dim temporarily as planets cross their faces. The main Kepler 
mission ended in 2013, but planet hunting continues in a revamped ‘K2’ mission. 


Kepler's field of view covers only about 1/400 of the night sky. 
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measurements of 
characteristics such as mass, 
temperature and atmospheric 
composition. 


NEXT-GENERATION 
TRANSIT SURVEY 

An ongoing project to 
search for exoplanets in 
Southern Hemisphere skies. 


TRANSITING EXOPLANET 
SURVEY SATELLITE 


The spacecraft, set to launch 
in 2017, will search for rocky 
worlds around nearby bright 
stars. Astronomers can then 
follow up the finds using 
ground-based telescopes. 


Transiting Exoplanet 
Survey Satellite 


JAMES WEBB 

SPACE TELESCOPE 

Targeted for a 2018 launch, 
the telescope will measure 
planetary atmospheres in 
infrared wavelengths to probe 
their chemical compositions. 


James Webb 
Space Telescope 


PLATO 


The space observatory, set to 
begin operating in 2024, will 
search for Earth-like worlds in 
the habitable zones of up to 
1 million stars. 


HABITABILITY PLOT: R. BARNES, V. S. MEADOWS & N. EVANS ASTROPHYS. J. (IN THE PRESS); PREPRINT AT HTTP://ARXIV.ORG/ABS/1509.08922 


SPECTRA: D. DEMING ET AL. ASTROPHYS. J. 774, 95-112 (2013), CC-BY-3.0. 


Starlight 


Ne ee o 
Cloudy Slim Clear 
atmosphere atmosphere atmosphere 


Planet 


Chemical analyses of how the starlight is absorbed 
reveals compounds such as water in the cloudy skies of 
distant exoplanets. 


-— Model spectra with more haze 
— Model spectra with less haze 
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Kepler-22 b 


Habitable zone 


So, is there life? 


Maybe. Now the question is how to decide which of the potentially thousands of exoplanets to 
pursue further. Researchers recently devised a ‘habitability index’ that shows which planets are 
most likely to have liquid water on their surface. The index can be compared against other 
measures — such as the amount of starlight received by the planet — to explore which planets 
might be worth targeting first for searches for extraterrestrial life. 
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NEWS 


ENTANGLEMENT 


Many physicists believe 
that entanglement is 

the essence of quantum 
weirdness — and some now 
suspect that it may also be 
the essence of space-time. 


BY RON COWEN 


290 | 


of his first sabbatical from teaching, Mark 

Van Raamsdonk decided to tackle one of 
the deepest mysteries in physics: the relation- 
ship between quantum mechanics and gravity. 
After a year of work and consultation with col- 
leagues, he submitted a paper on the topic to 
the Journal of High Energy Physics. 

In April 2010, the journal sent him a rejec- 
tion — with a referee's report implying that 
Van Raamsdonk, a physicist at the University of 
British Columbia in Vancouver, was a crackpot. 

His next submission, to General Relativity 
and Gravitation, fared little better: the referee’s 
report was scathing, and the journal’s editor 
asked for a complete rewrite. 

But by then, Van Raamsdonk had entered a 
shorter version of the paper into a prestigious 


E early 2009, determined to make the most 
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annual essay contest run by the Gravity 
Research Foundation in Wellesley, Massachu- 
setts. Not only did he win first prize, but he also 
got to savour a particularly satisfying irony: the 
honour included guaranteed publication in 
General Relativity and Gravitation. The journal 
published the shorter essay’ in June 2010. 

Still, the editors had good reason to be 
cautious. A successful unification of quantum 
mechanics and gravity has eluded physicists 
for nearly a century. Quantum mechanics gov- 
erns the world of the small — the weird realm 
in which an atom or particle can be in many 
places at the same time, and can simultaneously 
spin both clockwise and anticlockwise. Gravity 
governs the Universe at large — from the fall 
of an apple to the motion of planets, stars and 
galaxies — and is described by Albert Einstein’s 


PARAMOUNT PICTURES 


WARNER BROS. ENTERTAINMENT/ 


general theory of relativity, announced 100 years 
ago this month. The theory holds that gravity is 
geometry: particles are deflected when they pass 
near a massive object not because they feel a 
force, said Einstein, but because space and time 
around the object are curved. 

Both theories have been abundantly verified 
through experiment, yet the realities they 
describe seem utterly incompatible. And from 
the editors’ standpoint, Van Raamsdonk’s 
approach to resolving this incompatibility was 
strange. All that’s needed, he asserted, is ‘entan- 
glement’: the phenomenon that many physicists 
believe to be the ultimate in quantum weirdness. 
Entanglement lets the measurement of one 
particle instantaneously determine the state of 
a partner particle, no matter how far away it may 
be — even on the other side of the Milky Way. 


Black holes such as the one depicted in /nterstellar 
(2014) can be connected by wormholes. 


Einstein loathed the idea of entanglement, 
and famously derided it as “spooky action at a 
distance”. But it is central to quantum theory. 
And Van Raamsdonk, drawing on work by 
like-minded physicists going back more than 
a decade, argued for the ultimate irony — that, 
despite Einstein’s objections, entanglement 
might be the basis of geometry, and thus of 
Einstein’s geometric theory of gravity. “Space- 
time,” he says, “is just a geometrical picture of 
how stuff in the quantum system is entangled.” 

This idea is a long way from being proved, 
and is hardly a complete theory of quantum 
gravity. But independent studies have reached 
much the same conclusion, drawing intense 
interest from major theorists. A small indus- 
try of physicists is now working to expand the 
geometry-—entanglement relationship, using all 
the modern tools developed for quantum com- 
puting and quantum information theory. 

“I would not hesitate for a minute,” says 
physicist Bartlomiej Czech of Stanford Uni- 
versity in California, “to call the connections 
between quantum theory and gravity that have 
emerged in the last ten years revolutionary.” 


GRAVITY WITHOUT GRAVITY 

Much of this work rests on a discovery” 
announced in 1997 by physicist Juan 
Maldacena, now at the Institute for Advanced 
Study in Princeton, New Jersey. Maldacena’s 
research had led him to consider the relation- 
ship between two seemingly different model 
universes. One is a cosmos similar to our own. 
Although it neither expands nor contracts, it 
has three dimensions, is filled with quantum 
particles and obeys Einstein's equations of grav- 
ity. Known as anti-de Sitter space (AdsS), it is 
commonly referred to as the bulk. The other 
model is also filled with elementary particles, 
but it has one dimension fewer and doesn't 
recognize gravity. Commonly known as the 
boundary, it isa mathematically defined mem- 
brane that lies an infinite distance from any 
given point in the bulk, yet completely encloses 
it, much like the 2D surface of a balloon enclos- 
ing a 3D volume of air. The boundary particles 
obey the equations ofa quantum system known 
as conformal field theory (CFT). 

Maldacena discovered that the boundary 
and the bulk are completely equivalent. Like 
the 2D circuitry of a computer chip that 
encodes the 3D imagery of a computer game, 
the relatively simple, gravity-free equations 
that prevail on the boundary contain the same 
information and describe the same physics as 
the more complex equations that rule the bulk. 

“It's kind of a miraculous thing,’ says Van 
Raamsdonk. Suddenly, he says, Maldacena’s 
duality gave physicists a way to think about 
quantum gravity in the bulk without thinking 
about gravity at all: they just had to look at the 
equivalent quantum state on the boundary. 


FEATURE | NEWS 


And in the years since, so many have rushed to 
explore this idea that Maldacena’s paper is now 
one of the most highly cited articles in physics. 

Among the enthusiasts was Van Raams- 
donk, who started his sabbatical by pondering 
one of the central unsolved questions posed 
by Maldacena’s discovery: exactly how does a 
quantum field on the boundary produce grav- 
ity in the bulk? There had already been hints’ 
that the answer might involve some sort of 
relation between geometry and entanglement. 
But it was unclear how significant these hints 
were: all the earlier work on this idea had dealt 


1 HAD UNDERSTOOD 
SOMETHING THAT 
NO ONE HAD 
UNDERSTOOD 
BEFORE.” 


with special cases, such as a bulk universe that 
contained a black hole. So Van Raamsdonk 
decided to settle the matter, and work out 
whether the relationship was true in general, 
or was just a mathematical oddity. 

He first considered an empty bulk universe, 
which corresponded to a single quantum field 
on the boundary. This field, and the quan- 
tum relationships that tied various parts of it 
together, contained the only entanglement in 
the system. But now, Van Raamsdonk won- 
dered, what would happen to the bulk universe 
if that boundary entanglement were removed? 

He was able to answer that question using 
mathematical tools* introduced in 2006 by 
Shinsei Ryu, now at the University of Illinois 
at Urbana-Champaign, and Tadashi Takanagi, 
now at the Yukawa Institute for Theoretical 
Physics at Kyoto University in Japan. Their 
equations allowed him to model a slow and 
methodical reduction in the boundary field’s 
entanglement, and to watch the response in 
the bulk, where he saw space-time steadily 
elongating and pulling apart (see “The entan- 
glement connection’). Ultimately, he found, 
reducing the entanglement to zero would 
break the space-time into disjointed chunks, 
like chewing gum stretched too far. 

The geometry—entanglement relationship 
was general, Van Raamsdonk realized. Entan- 
glement is the essential ingredient that knits 
space-time together into a smooth whole — not 
just in exotic cases with black holes, but always. 

“T felt that I had understood something 
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The ghostly quantum phenomenon of entanglement may be what knits space-time into a smooth whole. 


In an infinite model universe known as 
anti-de Sitter space, the effects of gravity at 
any point x in the interior are mathematically 
equivalent to a quantum field theory on its 
boundary. This universe can be visualized in 
2D by filling it with imaginary triangles. 
Although the triangles are identical, they look 
increasingly distorted as they approach the 
boundary. 


Physicists noticed that this pattern 
resembled diagrams called tensor 
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In 1935, Albert Einstein, Boris Podolsky and Nathan Rosen (EPR) pointed out that a connection can exist 
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The bulk-boundary correspondence 
implies that space on the inside is built from 
quantum entanglement around the outside. 
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Also in 1935, Einstein and 
Rosen (ER) showed that widely 
separated black holes can be 
connected by a tunnel through 
space-time now often known 
as a wormhole. 


Physicists suspect that the 
connection in a wormhole and the 
connection in quantum entanglement 
are the same thing, just ona 
vastly different scale. 

Aside from their size there is 

no fundamental difference. 1 


Quantum 
entanglement 


Particle 
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about a fundamental question that perhaps 
nobody had understood before,” he recalls: 
“Essentially, what is space-time?” 


ENTANGLEMENT AND EINSTEIN 

Quantum entanglement as geometric glue — 
this was the essence of Van Raamsdonk’s 
rejected paper and winning essay, and an idea 
that has increasingly resonated among physi- 
cists. No one has yet found a rigorous proof, so 
the idea still ranks as a conjecture. But many 
independent lines of reasoning support it. 

In 2013, for example Maldacena and Leonard 
Susskind of Stanford published’ a related con- 
jecture that they dubbed ER= EPR, in honour 
of two landmark papers from 1935. ER, by Ein- 
stein and American-Israeli physicist Nathan 
Rosen, introduced’ what is now called a worm- 
hole: a tunnel through space-time connecting 
two black holes. (No real particle could actually 
travel through such a wormhole, science-fic- 
tion films notwithstanding: that would require 
moving faster than light, which is impossible.) 
EPR, by Einstein, Rosen and American physi- 
cist Boris Podolsky, was the first paper to clearly 
articulate what is now called entanglement’. 

Maldacena and Susskind’s conjecture was 
that these two concepts are related by more 
than a common publication date. If any two 
particles are connected by entanglement, the 
physicists suggested, then they are effectively 
joined by a wormhole. And vice versa: the 
connection that physicists call a wormhole is 
equivalent to entanglement. They are different 
ways of describing the same underlying reality. 

No one has a clear idea of what this under- 
lying reality is. But physicists are increasingly 
convinced that it must exist. Maldacena, 
Susskind and others have been testing the 
ER=EPR hypothesis to see ifit is mathemati- 
cally consistent with everything else that is 
known about entanglement and wormholes — 
and so far, the answer is yes. 


HIDDEN CONNECTIONS 

Other lines of support for the geometry- 
entanglement relationship have come from 
condensed-matter physics and quantum infor- 
mation theory: fields in which entanglement 
already plays a central part. This has allowed 
researchers from these disciplines to attack 
quantum gravity with a whole array of fresh 
concepts and mathematical tools. 

Tensor networks, for example, are a technique 
developed by condensed-matter physicists to 
track the quantum states of huge numbers of 
subatomic particles. Brian Swingle was using 
them in this way in 2007, when he was a gradu- 
ate student at the Massachusetts Institute of 
Technology (MIT) in Cambridge, calculating 
how groups of electrons interact in a solid mat- 
erial. He found that the most useful network for 
this purpose started by linking adjacent pairs 
of electrons, which are most likely to interact 
with each other, then linking larger and larger 
groups in a pattern that resembled the hierarchy 
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of a family tree. But then, during a course in 
quantum field theory, Swingle learned about 
Maldacena’s bulk—-boundary correspondence 
and noticed an intriguing pattern: the mapping 
between the bulk and the boundary showed 
exactly the same tree-like network. 

Swingle wondered whether this resemblance 
might be more than just coincidence. And in 
2012, he published® calculations showing that 
it was: he had independently reached much the 
same conclusion as Van Raamsdonk, thereby 
adding strong support to the geometry-entan- 
glement idea. “You can think of space as being 
built from entanglement in this very precise way 
using the tensors,” says Swingle, who is now at 
Stanford and has seen tensor networks become 
a frequently used tool to explore the geometry- 
entanglement correspondence. 

Another prime example of cross-fertilization 
is the theory of quantum error-correcting codes, 
which physicists invented to aid the construc- 
tion of quantum computers. These machines 
encode information not in bits but in ‘qubits’: 
quantum states, such as the up or down spin of 
an electron, that can take on values of 1 and 0 
simultaneously. In principle, when the qubits 
interact and become entangled in the right 
way, such a device could perform calculations 
that an ordinary computer could not finish in 
the lifetime of the Universe. But in practice, the 
process can be incredibly fragile: the slightest 
disturbance from the outside world will disrupt 
the qubits’ delicate entanglement and destroy 
any possibility of quantum computation. 

That need inspired quantum error-correcting 
codes, numerical strategies that repair cor- 
rupted correlations between the qubits and 
make the computation more robust. One 
hallmark of these codes is that they are always 
‘non-local’: the information needed to restore 
any given qubit has to be spread out over a wide 
region of space. Otherwise, damage in a single 
spot could destroy any hope of recovery. And 
that non-locality, in turn, accounts for the fas- 
cination that many quantum information theo- 
rists feel when they first encounter Maldacena’s 
bulk-boundary correspondence: it shows a very 
similar kind of non-locality. The information 
that corresponds to a small region of the bulk is 
spread over a vast region of the boundary. 

“Anyone could look at AdS-CFT and say 
that it’s sort of vaguely analogous to a quantum 
error-correcting code,’ says Scott Aaronson, a 
computer scientist at MIT. But in work pub- 
lished in June’, physicists led by Daniel Harlow 
at Harvard University in Cambridge and John 
Preskill of the California Institute of Technol- 
ogy in Pasadena argue for something stronger: 
that the Maldacena duality is itselfa quantum 
error-correcting code. They have demonstrated 
that this is mathemati- 
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saying for years that entanglement is somehow 
important for the emergence of the bulk,” says 
Harlow. “But for the first time, I think we are 
really getting a glimpse of how and why.” 


BEYOND ENTANGLEMENT 

That prospect seems to be enticing for the 
Simons Foundation, a philanthropic organiza- 
tion in New York City that announced in August 
that it would provide US$2.5 million per year 
for at least 4 years to help researchers to move 
forward on the gravity-quantum information 
connection. “Information theory provides a 
powerful way to structure our thinking about 
fundamental physics,” says Patrick Hayden, 
the Stanford physicist who is directing the pro- 
gramme. He adds that the Simons sponsorship 
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will support 16 main researchers at 14 institu- 
tions worldwide, along with students, postdocs 
and a series of workshops and schools. Ulti- 
mately, one major goal is to build up a compre- 
hensive dictionary for translating geometric 
concepts into quantum language, and vice versa. 
This will hopefully help physicists to find their 
way to the complete theory of quantum gravity. 
Still, researchers face several challenges. 
One is that the bulk-boundary correspond- 
ence does not apply in our Universe, which 
is neither static nor bounded; it is expanding 
and apparently infinite. Most researchers in the 
field do think that calculations using Malda- 
cena’s correspondence are telling them some- 
thing true about the real Universe, but there is 
little agreement as yet on exactly how to trans- 
late results from one regime to the other. 
Another challenge is that the standard 
definition of entanglement refers to particles 
only at a given moment. A complete theory of 
quantum gravity will have to add time to that 
picture. “Entanglement is a big piece of the 
story, but it’s not the whole story,” says Susskind. 
He thinks physicists may have to embrace 
another concept from quantum information 
theory: computational complexity, the number 
of logical steps, or operations, needed to con- 
struct the quantum state of a system. A system 
with low complexity is analogous to a quantum 
computer with almost all the qubits on zero: it is 
easy to define and to build. One with high com- 
plexity is analogous to a set of qubits encoding a 
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number that would take aeons to compute. 

Susskind began to think about computa- 
tional complexity about a decade ago, when 
he noticed that a solution to Einstein’s equa- 
tions of general relativity allowed a wormhole 
in AdS space to get longer and longer as time 
went on. What did that correspond to on the 
boundary, he wondered? What was changing 
there? Susskind knew that it couldn't be entan- 
glement, because the correlations that produce 
entanglement between different particles on 
the boundary reach their maximum in less 
than a second”. In an article last year'’, how- 
ever, he and Douglas Stanford, now at the Insti- 
tute for Advanced Study, showed that as time 
progressed, the quantum state on the bound- 
ary would vary in exactly the way expected 
from computational complexity. 

“Tt appears more and more that the growth of 
the interior ofa black hole is exactly the growth 
of computational complexity,’ says Susskind. If 
quantum entanglement knits together pieces of 
space, he says, then computational complexity 
may drive the growth of space — and thus bring 
in the elusive element of time. One potential 
consequence, which he is just beginning to 
explore, could be a link between the growth of 
computational complexity and the expansion 
of the Universe. Another is that, because the 
insides of black holes are the very regions where 
uantum gravity is thought to dominate, com- 
putational complexity may have a key role ina 
complete theory of quantum gravity. 

Despite the remaining challenges, there is a 
sense among the practitioners of this field that 
they have begun to glimpse something real and 
very important. “I didn't know what space was 
made of before,’ says Swingle. “It wasnt clear 
that question even had meaning.” But now, 
he says, it is becoming increasingly apparent 
that the question does make sense. “And the 
answer is something that we understand,’ says 
Swingle. “It’s made of entanglement.” 

As for Van Raamsdonk, he has written some 
20 papers on quantum entanglement since 
2009. All of them, he says, have been accepted 
for publication. m 
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Ron Cowen is a freelance writer based in 
Silver Spring, Maryland. 
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The central route of China’s South-to-North Water Diversion project runs through Jiaozuo in Henan province. 


Transfer project cannot 
meet China’s water needs 


Better local water management is the way to keep pace with escalating demands, 
not pumping water across the country, warn Jon Barnett and colleagues. 


Imost one year ago, Beijing began 
A& receive water channelled by the 

South-to-North Water Diversion 
(SNWD) project. The biggest inter-basin 
transfer scheme in the world, the SNWD 
project has the capacity to deliver 25 billion 
cubic metres of fresh water per year from the 
Yangtze River in China's south to the drier 
north by two routes — each of which cov- 
ers a distance of more than 1,000 kilometres. 
The project connects four major river basins, 
three megacities, six provinces and hundreds 
of millions of water users and polluters. 


Its success is already in question. Reservoir 
and canal construction costs have report- 
edly reached US$80 billion, and more than 
300,000 people have been displaced’. Pollu- 
tion and environmental fallout, as well as high 
maintenance costs and water prices, make the 
project unsustainable both ecologically and 
socially. And the transfer of water does not 
address the underlying causes of water short- 
ages in the north, namely pollution and inef- 
ficient agricultural, industrial and urban use 
— the effects of which we have been studying 
over the past decade. 


North China could be self-sufficient in 
water without the transfer of water from 
the south. But the necessary steps — among 
them, improving local pollution monitoring 
and building better irrigation infrastructure 
— are inadequately implemented. 

Increasing supply is viewed as the main 
solution to water scarcity because of the con- 
flicting roles of the Chinese government as 
both entrepreneur and regulator. Incentives 
for economic growth in China still outweigh 
incentives for pollution control and limits 
on water extraction, despite ever stricter 
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> environmental laws. Many industries, such 
as the country’s huge hydropower sector, 
profit from expensive solutions to boosting 
water supplies. 

China's water system needs an overhaul. 
Institutional reforms must divorce profit 
motives from regulatory functions; data and 
decisions must be disclosed to the public; 
and the influence of the hydropower sector 
on water-resource management needs to 
be restricted. The volume of water being 
diverted along existing routes of the SNWD 
project must be reduced and extensions 
to the project must be shelved. 

Better local management of resources is 
the only way to bring secure and sustainable 
water to all parts of China. 


BIG WATER 

China's history of grand water-engineering 
projects is almost as old as the nation itself, 
and is inextricably knit with the country’s pol- 
itics, development and self-image. The first 
dam was built in around 600 Bc at Anfeng 
Tang in eastern China. It created a still exist- 
ing reservoir 100 kilometres in circumference 
that could irrigate an area of 24,000 square 
kilometres. Ever since, most of China's water- 
management systems have been created and 
run by the state. 

The SNWD project transports water in two 
ways (see ‘South-to-north water transfer’). Its 
eastern route has the capacity to supply up to 
14.8 billion cubic metres of water per year to 
the provinces of Jiangsu, Anhui, Shandong 
and Hebei, and to the city of Tianjin. The 
water travels through a system of pumps, riv- 
ers, lakes, reservoirs and canals that includes 
the Grand Canal, which was built around 
500 Bc. Its central route will provide up to 
9.5 billion cubic metres of water per year, 
including one-third of Beijing’s water, from 
the Danjiangkou reservoir on the Han River 
(a tributary of the Yangtze). During the con- 
struction of this route, the water level of the 
reservoir was raised by 13 metres, which 
resulted in the resettlement of 180,000 peo- 
ple from Shiyan city and 160,000 from 
Nanyang city. 

A third, western, route is planned that 
would divert up to 20 billion cubic metres 
of water from three tributaries of the upper 
Yangtze through tunnels to the upper reaches 
of the Yellow River. Its path is under debate 
and there has been no commitment to com- 
mencing its construction nor any indication 
of when a decision might be made’. In our 
view, the scope for improving water manage- 
ment makes this extra route unnecessary. 

Without question, northern China, which 
includes the Hai, Huai and Yellow river basins, 
is short of water. The region’s annual per 
capita water availability is only around half of 
the international threshold for water stress”. 
Water scarcity is most acute in the Hai basin, 
where Beijing is located. Farms and cities 
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d China has the capacity to pump 25 billion cubic 
of fresh water per year over a distance of more than 


ities in the drier north. 


WESTERN ROUTE 
Yet to be built but would 
divert up to 20 billion 
cubic metres of water 
per year from tributaries 
of the Yangtze River to 
the Yellow River. 


ae Existing 


have increasingly drawn on groundwater 
such that 50% of aquifers in the North China 
Plain are now below sea level. This scarcity is 
compounded by poor water quality in up to 
60% of water in the rivers of the north, which 
further reduces the supply of clean water for 
drinking and domestic use’. 

The problem is more a scarcity of manage- 
ment than of natural water’. Inefficient agri- 
cultural production consumes about 75% of 
the region’s water and is growing rapidly. In 
areas where cereal crops are flood irrigated, 
losses can exceed 50%. In addition, the lack of 
storage systems along the Yellow River means 
that farmers must use water when it is made 
available — not necessarily when they need it’. 

The idea of water scarcity in the north is 
perpetuated by China’s government for sev- 
eral reasons. It justifies taking water from 
the south to achieve President Xi Jinping’s 
ambitions for a mega-economic region that 
encompasses Beijing, Tianjin and Hebei. And 
it serves the interests of those in the business 
of supplying water, including China’s huge, 
state-owned water-engineering firms. 

But the SNWD project does not ensure a 
reliable supply to the north. Pollution is a per- 
vasive risk. In response to complaints about 
quality from provinces receiving water, the 
National Development and Reform Com- 
mission ordered changes in land use across 
the Danjiangkou reservoir catchment area 
to reduce urban and agricultural run-off. 
Development has been prohibited in some 
areas, and in others communities have been 
resettled. The use of pesticides and fertilizers 
has been limited and industry is subjected to 
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ometres — from the Yangtze River in China’s south to 


CENTRAL ROUTE 
Provides up to 9.5 billion 
cubic metres of water per 
year, including one-third 
of Beijing’s water. 


___| RESERVOIR 


EASTERN ROUTE 
Supplies up to 14.8 billion 
cubic metres of water per year 
to the provinces of Jiangsu, 
Anhui, Shandong and Hebei 
and the city of Tianjin. 


DANJIANGKOU 


) More than 300,000 people 
have been resettled owing 
__| to rising waters, and to 

| lessen pollution. 


South China 
Sea 


stricter pollution controls. In 2015, Danjiang- 
kou reservoir won a national award for water 
quality — at the cost of the impoverishment 
of the hundreds of thousands of people who 
were forced to move. 

And there are other costs. Wang Mengshu, 
acivil engineer at Beijing Jiaotong University, 
has suggested that the expense of maintain- 
ing the SNWD conduits was vastly under- 
estimated. The price of transferred water will 
be too expensive for farmers, who will there- 
fore continue to exploit groundwater’. 

The SNWD project also poses risks in 
source areas. Claims of abundant water in 
the Yangtze hide the fact that shortages do 
occur. In the past decade, there have been 
two severe droughts in the Yangtze basin. And 
periods of water scarcity are more likely in the 
future because of an increase in the number 
of withdrawals and dams, as well as the effects 
of urbanization and climate change. The tim- 
ing of water transfers is therefore important: 
should extractions from the Yangtze occur at 
times of low flow, saline waters from its estu- 
ary could be drawn in. Nearby Shanghai's 
population of 24 million would then face 
critical water shortages until discharge levels 
rose again’. 

Governance of the SNWD project 
remains unresolved. Both the state-owned 
HydroChina Corporation and the central 
government's SNWD project construction 
committee seek to control the flow of the 
project’s waters, even though this is a core 
responsibility of the Chinese Ministry of 
Water Resources. Corporatization of the state 
is reaching into the management of water, 
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Hundreds of thousands of residents were relocated during construction of the project. 


creating tension between motives and profit, 
uncertainty about roles and responsibilities 
and impediments to coordinated manage- 
ment of the nation’s water courses. 


BEYOND PIPES 

In cities, rainwater harvesting and wastewater 
recycling can meet much of the demand. 
According to Qiu Baoxing, a former vice- 
minister of the Chinese Ministry of Hous- 
ing and Urban-Rural Development, the 
SNWD project could have been avoided if 
one-third of buildings in Beijing collected 
rainwater. Increased investment in treat- 
ment systems, efficient irrigation and the 
monitoring and enforcement of pollution 
levels can also improve the supply of usable 
water~*”. Lower-quality water could be used 
for urban landscaping and industry, and some 
water-intensive activities could be moved to 
the south. 

Such solutions require coordination with 
local governments, which are driven by 
growth and profit. When bureaucrats behave 
like businessmen and state-owned enterprises 
operate like private corporations, even strong 
environmental laws have little effect®. By con- 
trast, the SNWD project is easy to adminis- 
trate, politically feasible and drives growth. 

There are signs of change. Since 2006, envi- 
ronmental targets have been included in per- 
formance criteria for local leaders°. And there 
have been experiments in increasing disclo- 
sure to the public of data on the environmen- 
tal performance of firms’. Both measures 
have made local governments and businesses 


more accountable for environmental stand- 
ards. They have led to lower levels of pollution 
and encouraged investment in cleaner tech- 
nologies and the closure of inefficient plants. 
But the enforcement of standards and laws 
varies from region to region — the economic 
imperative still dominates in less-developed 
regions — and data can be falsified’. 

The Chinese government's authority rests 
on maintaining social stability and economic 
growth. The government must therefore 
respond to challenges such as corruption, 
public-health issues and inequality®. Given 


improved living standards, greater levels of 


education and the 

proliferation of 

social media, high 

levels of pollution 

can no longer be 

ignored. Both the 

central and local 

governments in China must be seen to be 
controlling pollution, which can lead to 
secrecy and misinformation. In our experi- 
ence, detailed data on the flow of water and 
pollution levels in major rivers can be difficult 
to obtain and must often be paid for. 


SMALL STEPS 

As is already done for air pollution, the central 
and local Chinese governments should dis- 
close information on water to demonstrate 
the responsible management of resources 
to the public. Providing accessible infor- 
mation about the allocation of water rights 
— as well as the allocation of water itself to 


provinces, irrigation districts and farmers — 
would increase public trust in the system and 
improve the accountability of water manag- 
ers, local government and firms®. 

Local environmental-protection bureaus 
should be given the autonomy and resources 
to collect and analyse monitoring data inde- 
pendently and to enforce pollution standards. 
Exporters that rely on foreign investment 
must increasingly comply with standards 
and regulations as their parent companies 
and consumers demand proof of environ- 
mental responsibility. Industrial water users 
should consider cleaner production as a 
path to savings, new markets and improved 
competitiveness’. 

In agriculture, losses can be reduced by 
lining irrigation canals with concrete. Water 
should be supplied only at times when irri- 
gation is necessary’. The rotation of wheat 
with higher-value crops that take less water 
to grow, such as peanuts, will also improve the 
efficiency of water use”®. 

Investment in new technologies is needed, 
including systems to separate urban water 
according to quality, recycle waste water, 
encourage water conservation and improve 
the harvesting of rainwater. This would 
require performance targets to be set for 
local managers, as well as investment in and 
incentives for building smaller-scale water 
infrastructure. Campaigns to increase pub- 
lic awareness of water issues should also be 
implemented. 

Constraining the influence of the hydro- 
power sector on water-resource management 
will help to shift public investment towards 
these smaller-scale technologies. The sector 
is already expanding into overseas markets to 
compensate for reduced domestic demand in 
the wake of disquiet about water pollution. 

As its limitations become clear, the SNWD 
project might well mark the nadir of big-engi- 
neering solutions to China's water problems. m 
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Marcel Grossmann (left) and Michele Besso (right), university friends of Albert Einstein (centre), both made important contributions to general relativity. 


% 


titi, 


Einstein was no lone genius 


Lesser-known and junior colleagues helped the great physicist to piece together his 
general theory of relativity, explain Michel Janssen and Jiirgen Renn. 


century ago, in November 1915, 
Aut Einstein published his 

general theory of relativity in four 
short papers in the proceedings of the Prus- 
sian Academy of Sciences in Berlin’. The 
landmark theory is often presented as the 
work ofa lone genius. In fact, the physicist 
received a great deal of help from friends 
and colleagues, most of whom never rose 
to prominence and have been forgotten”. 
(For full reference details of all Einstein texts 
mentioned in this piece, see Supplementary 
Information; go.nature.com/ufcgp9.) 

Here we tell the story of how their 
insights were woven into the final version 
of the theory. Two friends from Einstein’s 
student days — Marcel Grossmann and 
Michele Besso — were particularly impor- 
tant. Grossmann was a gifted mathemati- 
cian and organized student who helped 
the more visionary and fanciful Einstein at 
crucial moments. Besso was an engineer, 
imaginative and somewhat disorganized, 
and a caring and lifelong friend to Einstein. 
A cast of others contributed too. 

Einstein met Grossmann and Besso at 
the Swiss Federal Polytechnical School in 
Zurich® — later renamed the Swiss Federal 
Institute of Technology (Eidgenéssische 
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Technische Hochschule; ETH) — where, 
between 1896 and 1900, he studied to 
become a school teacher in physics and 
mathematics. Einstein also met his future 
wife at the ETH, classmate Mileva Marié. 
Legend has it that Einstein often skipped 
class and relied on Grossmann’s notes to 
pass exams. 

Grossmann’s father helped Einstein to 
secure a position at the patent office in 
Berne in 1902, where Besso joined him 
two years later. Discussions between Besso 
and Einstein earned the former the sole 
acknowledgment in the most famous of 
Einstein’s 1905 papers, the one introduc- 
ing the special theory of relativity. As well 
as publishing the papers that made 1905 
his annus mirabilis, Einstein completed 
his dissertation that year to earn a PhD in 
physics from the University of Zurich. 

In 1907, while still at the patent office, he 
started to think about extending the prin- 
ciple of relativity from uniform to arbitrary 
motion through a new theory of gravity. 
Presciently, Einstein wrote to his friend 
Conrad Habicht — whom he knew from a 
reading group in Berne mockingly called the 
Olympia Academy by its three members — 
saying that he hoped that this new theory 


would account for a discrepancy of about 
43” (seconds of arc) per century between 
Newtonian predictions and observations 
of the motion of Mercury’s perihelion, the 
point of its orbit closest to the Sun. 

Einstein started to work in earnest on this 
new theory only after he left the patent office 
in 1909, to take up professorships first at the 
University of Zurich and two years later at 
the Charles University in Prague. He realized 
that gravity must be incorporated into the 
structure of space-time, such that a particle 
subject to no other force would follow the 
straightest possible trajectory through a 
curved space-time. 

In 1912, Einstein returned to Zurich and 
was reunited with Grossmann at the ETH. 
The pair joined forces to generate a fully 
fledged theory. The relevant mathematics 
was Gauss’s theory of curved surfaces, which 
Einstein probably learned from Grossmann’s 
notes. As we know from recollected conver- 
sations, Einstein told Grossmann’: “You 
must help me, or else I'll go crazy.” 

Their collaboration, recorded in 
Einstein’s ‘Zurich notebook, resulted in a 
joint paper published in June 1913, known 
as the Entwurf (‘outline’) paper. The 
main advance between this 1913 Entwurf 
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theory and the general relativity theory 
of November 1915 are the field equa- 
tions, which determine how matter curves 
space-time. The final field equations are 
‘generally covariant’: they retain their form 
no matter what system of coordinates is 
chosen to express them. The covariance of 
the Entwurf field equations, by contrast, 
was severely limited. 


TWO THEORIES 

In May 1913, as he and Grossmann put the 
finishing touches to their Entwurf paper, 
Einstein was asked to lecture at the annual 
meeting of the Society of German Natural 
Scientists and Physicians to be held that Sep- 
tember in Vienna, an invitation that reflects 
the high esteem in which the 34-year-old 
was held by his peers. 

In July 1913, Max Planck and Walther 
Nernst, two leading physicists from Berlin, 
came to Zurich to offer Einstein a well-paid 
and teaching-free position at the Prussian 
Academy of Sciences in Berlin, which he 
swiftly accepted and took up in March 1914. 
Gravity was nota pressing problem for Planck 
and Nernst; they were mainly interested in 
what Einstein could do for quantum physics. 

Several new theories had been proposed 
in which gravity, like electromagnetism, was 
represented by a field in the flat space-time 
of special relativity. A particularly promising 
one came from the young Finnish physicist 
Gunnar Nordstrom. In his Vienna lecture, 
Einstein compared his own Entwurf theory 
to Nordstrém’s theory. Einstein worked on 
both theories between May and late August 
1913, when he submitted the text of his lec- 
ture for publication in the proceedings of the 
1913 Vienna meeting. 

In the summer of 1913, Nordstrém visited 
Einstein in Zurich. Einstein convinced him 
that the source of the gravitational field in 
both their theories should be constructed 
out of the ‘energy-momentum tensor’: in 
pre-relativistic theories, the density and the 
flow of energy and momentum were repre- 
sented by separate quantities; in relativity 
theory, they are combined into one quantity 
with ten different components. 

This energy-momentum tensor made its 
first appearance in 1907-8 in the special- 
relativistic reformulation of the theory of 
electrodynamics of James Clerk Maxwell 
and Hendrik Antoon Lorentz by Hermann 
Minkowski. It soon became clear that an 
energy-momentum tensor could be defined 
for physical systems other than electromag- 
netic fields. The tensor took centre stage in 
the new relativistic mechanics presented in 
the first textbook on special relativity, Das 
Relativitdtsprinzip, written by Max Laue in 
1911. In 1912, a young Viennese physicist, 
Friedrich Kottler, generalized Laue’s formal- 
ism from flat to curved space-time. Einstein 
and Grossmann relied on this generalization 


in their formulation of the Entwurf theory. 
During his Vienna lecture, Einstein called 
for Kottler to stand up and be recognized for 
this work’. 

Einstein also worked with Besso that 
summer to investigate whether the Entwurf 
theory could account for the missing 43” per 
century for Mercury’s perihelion. Unfortu- 
nately, they found that it could only explain 
18”. Nordstrém’s theory, Besso checked later, 
gave 7" in the wrong direction. These calcu- 
lations are preserved in the ‘Einstein—Besso 
manuscript of 1913. 

Besso contributed significantly to the 
calculations and raised interesting ques- 
tions. He wondered, for instance, whether 
the Entwurf field equations have an unam- 
biguous solution that uniquely determines 
the gravitational field of the Sun. Historical 
analysis of extant manuscripts suggests that 
this query gave Einstein the idea for an argu- 
ment that reconciled him with the restricted 
covariance of the Entwurf equations. This 
‘hole argument’ seemed to show that gen- 
erally covariant field equations cannot 
uniquely determine the gravitational field 
and are therefore inadmissible’. 

Einstein and Besso also checked whether 
the Entwurf equations hold in a rotating 
coordinate system. In that case the inertial 
forces of rotation, such as the centrifugal 
force we experience on a merry-go-round, 
can be interpreted as gravitational forces. 
The theory seemed to pass this test. In 
August 1913, however, Besso warned him 
that it did not. Einstein did not heed the 
warning, which would come back to haunt 
him. 

In his lecture in Vienna in September 
1913, Einstein concluded his comparison of 
the two theories with a call for experiment 
to decide. The Entwurf theory predicts 
that gravity bends 


light, whereas Nor- “Worried that 
dstrém’s does not. It Hilbert might 
would take another beat him to 
five years to find out. the punch, 
Erwin Finlay Freun- Einstein 

dlich, ajuniorastron- yushednew 
omer in Berlin with equations into 
whom Einstein had print. ” 


been in touch since 

his days in Prague, travelled to Crimea for 
the solar eclipse of August 1914 to determine 
whether gravity bends light but was interned 
by the Russians just as the First World War 
broke out. Finally, in 1919, English astrono- 
mer Arthur Eddington confirmed Einstein's 
prediction of light bending by observing the 
deflection of distant stars seen close to the 
Sun's edge during another eclipse, making 
Einstein a household name”. 

Back in Zurich, after the Vienna lecture, 
Einstein teamed up with another young 
physicist, Adriaan Fokker, a student of 
Lorentz, to reformulate the Nordstr6m 


theory using the same kind of mathematics 
that he and Grossmann had used to for- 
mulate the Entwurf theory. Einstein and 
Fokker showed that in both theories the 
gravitational field can be incorporated into 
the structure of a curved space-time. This 
work also gave Einstein a clearer picture of 
the structure of the Entwurf theory, which 
helped him and Grossmann in a second 
joint paper on the theory. By the time it 
was published in May 1914, Einstein had 
left for Berlin. 


THE BREAKTHROUGH 

Turmoil erupted soon after the move. 
Einstein’s marriage fell apart and Mileva 
moved back to Zurich with their two 
young sons. Albert renewed the affair 
he had started and broken off two years 
before with his cousin Elsa Lowenthal (née 
Einstein). The First World War began. Ber- 
lin’s scientific elite showed no interest in 
the Entwurf theory, although renowned 
colleagues elsewhere did, such as Lorentz 
and Paul Ehrenfest in Leiden, the Nether- 
lands. Einstein soldiered on. 

By the end of 1914, his confidence had 
grown enough to write a long exposition 
of the theory. But in the summer of 1915, 
after a series of his lectures in Gottingen had 
piqued the interest of the great mathemati- 
cian David Hilbert, Einstein started to have 
serious doubts. He discovered to his dismay 
that the Entwurf theory does not make 
rotational motion relative. Besso was right. 
Einstein wrote to Freundlich for help: his 
“mind was in a deep rut’, so he hoped that 
the young astronomer as “a fellow human 
being with unspoiled brain matter” could tell 
him what he was doing wrong. Freundlich 
could not help him. 

The problem, Einstein soon realized, lay 
with the Entwurf field equations. Worried 
that Hilbert might beat him to the punch, 
Einstein rushed new equations into print in 
early November 1915, modifying them the 
following week and again two weeks later in 
subsequent papers submitted to the Prussian 
Academy. The field equations were generally 
covariant at last. 

In the first November paper, Einstein 
wrote that the theory was “a real triumph” 
of the mathematics of Carl Friedrich Gauss 
and Bernhard Riemann. He recalled in this 
paper that he and Grossmann had con- 
sidered the same equations before, and 
suggested that if only they had allowed 
themselves to be guided by pure mathemat- 
ics rather than physics, they would never 
have accepted equations of limited covari- 
ance in the first place. 

Other passages in the first November 
paper, however, as well as his other papers 
and correspondence in 1913-15, tell a dif- 
ferent story. It was thanks to the elabora- 
tion of the Entwurf theory, with the help of 
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ETH Zurich, where Einstein met friends with whom he worked on general relativity. 


Grossmann, Besso, Nordstrém and Fokker, 
that Einstein saw how to solve the problems 
with the physical interpretation of these 
equations that had previously defeated him. 

In setting out the generally covariant field 
equations in the second and fourth papers, 
he made no mention of the hole argument. 
Only when Besso and Ehrenfest pressed 
him a few weeks after the final paper, dated 
25 November, did Einstein find a way out 
of this bind — by realizing that only coinci- 
dent events and not coordinates have physi- 
cal meaning. Besso had suggested a similar 
escape two years earlier, which Einstein had 
brusquely rejected’. 

In his third November paper, Einstein 
returned to the perihelion motion of 
Mercury. Inserting the astronomical data 
supplied by Freundlich into the formula 
he derived using his new theory, Einstein 
arrived at the result of 43” per century and 
could thus fully account for the difference 
between Newtonian theory and observa- 
tion. “Congratulations on conquering the 
perihelion motion,’ Hilbert wrote to him 
on 19 November. “IfI could calculate as fast 
as you can,’ he quipped, “the hydrogen atom 
would have to bring a note from home to be 
excused for not radiating” 

Einstein kept quiet on why he had been 
able to do the calculations so fast. They were 
minor variations on the ones he had done 
with Besso in 1913. He probably enjoyed 
giving Hilbert a taste of his own medicine: 
ina letter to Ehrenfest written in May 1916, 
Einstein characterized Hilbert’s style as 
“creating the impression of being super- 
human by obfuscating one’s methods”. 

Einstein emphasized that his general 
theory of relativity built on the work 
of Gauss and Riemann, giants of the 


mathematical world. But it also built on 
the work of towering figures in physics, 
such as Maxwell and Lorentz, and on the 
work of researchers of lesser stature, nota- 
bly Grossmann, Besso, Freundlich, Kottler, 
Nordstr6m and Fokker. As with many 
other major breakthroughs in the history 
of science, Einstein was standing on the 
shoulders of many scientists, not just the 
proverbial giants*. = 
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CORRECTION 

The Comment article ‘Einstein was no 
lone genius’ (M. Janssen and J. Renn 
Nature 527, 298-300; 2015) wrongly 
stated the dates during which Albert 
Einstein studied at the Swiss Federal 
Polytechnical School in Zurich. He was 
there between 1896 and 1900. 
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Charles L. Dodgson, better known as Lewis Carroll, in a self-portrait from the 1880s. 


MATHEMATICS 


Logic and 
Lewis Carroll 


As Alice’s Adventures in Wonderland reaches 150, Francine 
Abeles surveys its creator’s wide-ranging legacy. 
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mathematical lecturer at Christ Church 

College in the University of Oxford 
UK. His job was to prepare Christ Church 
men (for it was all men) to pass examina- 
tions in mathematics. Dodgson (1832-98) 
would go on to publish Alice’s Adventures 
in Wonderland (1865) and Through the 
Looking- Glass (1871) under the pen name 
Lewis Carroll, but he also produced many 
pamphlets and ten books on mathemati- 
cal topics. 

In some of these, he exhibited unusual 
methods — for rapid arithmetic, for exam- 
ple. Others featured innovative ideas that 
foreshadowed developments in the twenti- 
eth century, for instance in voting theory. 
All but two of these books were published by 
Macmillan (until this year, the parent com- 
pany of this journal’s publisher). Macmillan 
co-founder Alexander Macmillan was 
Dodgson’s trusted publisher and friend for 
35 years (see go.nature.com/9q8oqe). 

What unifies Carroll's oeuvre is the wit 
and colour apparent in the manifestations 
of his wide-ranging mathematical interests, 
particularly in geometry and logic. The Alice 
books contain many supreme examples. 
The “Mad Tea-Party’, for instance, has the 
Hare, Hatter, Dormouse and Alice circling 
around static place settings like numbers 
on a circle, as in a modular system, rather 
than ina line. Carroll developed the earliest 
modern use of today’s ‘logic trees, a graphi- 
cal technique for determining the validity 
of complex arguments that he called the 
‘method of trees. This was a step towards 
automated approaches to solving multiple 
connected problems of logic. True to form, 
the puzzles that Carroll solves with his trees 
are given quirky names — “The Problem of 
Grocers on Bicycles’, “The Pigs and Balloons 
Problem”. 

The ten sections of Carroll’s book of 
droll mathematical stories, A Tangled Tale, 
first appeared between 1880 and 1885 asa 
serial in the popular magazine The Monthly 
Packet. Carroll dubbed each part a ‘knot’ 
to signify the difficulty of the one, two or 
three problems it featured. In the follow- 
ing issue of the magazine, he would sum- 
marize the puzzle, solve it and comment on 
the solutions he had received from readers, 
often amusingly presented (see http://www. 
onlinemathlearning.com/tangled-tale. 
html). A Tangled Tale became a favourite 
of Josiah Willard Gibbs (1839-1903), the 
applied mathematician and physical chem- 
ist praised by Albert Einstein as “the greatest 
mind in American history”. 

Carroll believed 
that beyond their 
entertainment value, To purchase an ebook 
mental recrea-_ onLewisCarrollas 
tions such as games __ logic puzzler, see. 
and logic puzzles 


I n 1855, Charles L. Dodgson became the 
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ILLUSTRATION BY JOHN TENNIEL FROM ALICE'S 
ADVENTURES IN WONDERLAND (MACMILLAN, 1995) 


conferred a sense of power on the solver. 
This, he felt, enabled them to analyse any 
subject clearly and, most important, to 
detect and unravel fallacies. In this vein, 
Carroll puns about other knots in Alice’s 
Adventures in Wonderland. In Chapter 3, 
for instance, the Mouse responds to Alice’s 
comments that he had got to the fifth bend 
in his tale (which appears on the page as 
a serpentine, tail-shaped paragraph) by 
crying, “I had not!” Carroll's ever-curious 
adventurer misunderstands amusingly: “A 
knot’ said Alice, always ready to make her- 
self useful, and looking anxiously about her. 
‘Oh, do let me help to undo it!” 

Carroll was, of course, a devotee of word- 
play, as almost any page of Alices Adventures 
and A Tangled Tale reveals. A fan of acros- 
tics, Carroll dedicated the latter — published 
in book form in 1885 — to his friend and 
pupil, the 19-year-old Edith Rix, in the form 
of a poem that spells her name out in the 
second letter of each line: 


Beloved Pupil! Tamed by Thee, 
Addish=, Subtrac=, Multiplica=tion, 
Division, Fractions, Rule of Three, 
Attest thy deft manipulation! 


Then onward! Let the voice of fame 
From Age to Age repeat thy story, 
Till thy hast won thyself a name 
Exceeding even Euclid’ glory! 


In the last decades of his life, Carroll 
published three mathematical pieces in 
Nature. The first, on a method for finding 
the day of the week for any date (L. Car- 


roll Nature 35, 517; 1887), reflects the > 


Alice’s circular conversation with the Caterpillar 
is a gem of semantic wordplay. 
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The Brothers Vonnegut: Science and Fiction in the House of Magic 
Ginger Strand FARRAR, STRAUS AND GIROUX (2015) 

Kurt Vonnegut, beloved troublemaker and science-fiction novelist, 
famously studied chemistry — but it was his brother Bernie who 
shone in the field. In this engrossing cultural history, Ginger Strand 
traces the brothers’ intellectual development during the Second 
World War and its chill aftermath. Military interest led Bernie to 
research silver iodide as a trigger for cloud seeding at General Electric, 
and Kurt's horrifying experiences in combat inspired his inimitable 
fiction. Strand shows how both men, by calling in different ways for 
progress to be decoupled from conflict, revealed a rare integrity. 


Thunder & Lightning: Weather Past, Present, Future 

Lauren Redniss RANDOM House (2015) 

Writer and artist Lauren Redniss’s Radioactive (It Books, 2010; see 

G. Frazzetto Nature 469, 29; 2011) was a beautiful tour de force, 
meshing superb illustrations with an original telling of the lives of 
Marie and Pierre Curie. Now, in another aesthetically charged and 
deeply researched account, Redniss takes on meteorology. Here are 
phenomena from fog to cyclones; cloud types (a series of nebulous 
‘portraits’); the sensory appreciation of weather, such as Benjamin 
Franklin’s air bathing, or snowfall’s “muffled quietude” — and more. A 
wild rainstorm of a book, pelting the reader with ideas and inspiration. 
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Fallen Glory: The Lives and Deaths of Twenty Lost Buildings from 
the Tower of Babel to the Twin Towers 

James Crawford OLD STREET (2015) 

This multiple biography of vanished monoliths is itself monolithic, 
wending its way from Iraq to Manhattan and beyond. Standouts 

in the narratives built by writer James Crawford include the Tower 
of Babel, looming in manifestations from Mesopotamian Emperor 
Nebuchadnezzar’s vast ziggurat Etemenanki to Pieter Bruegel the 
Elder’s exquisite and disturbing 1563 painting. The Bastille, Roman 
Forum, Berlin Wall — all eloquently fall, along with New York’s Twin 
Towers in the hideous events of 2001. My only quibble? No index. 


The Orange Trees of Marrakesh: Ibn Khaldun and the Science of 
Man 

Stephen Frederic Dale HARVARD UNIVERSITY PRESS (2015) 

Six centuries ago, a Tunisian scholar created a new mirror for 
humankind. In his masterwork Mugaddimah, lbn Khaldun 
(1332-1406) became the first person to approach history 


tT 
HE ORANGE Thbas 
of MARRAKESH 


scientifically, by analysing social, economic and political evidence to 
| Sciewed the ; reveal cycles of societal change. In this sober study, historian Stephen 
i ag Frederic Dale argues that Ibn Khaldun’s work is a key milestone on the 
= - road from Greek to Enlightenment thought, chiming with the radical 


reasoning of philosophers such as Montesquieu and Adam Smith. 


Hamburgers in Paradise: The Stories Behind the Food We Eat 
Louise O. Fresco (transl. Liz Waters) PRINCETON UNIVERSITY PRESS (2015) 
Behind the whimsical title is a serious cultural history of food, 

newly translated from Dutch. Plant scientist Louise Fresco, a former 
assistant director-general of the Food and Agriculture Organization 
of the United Nations, argues that the trope of paradise as effortless 
abundance permeates humanity’s tortured relationship with the 
edible. Her comprehensive trawl through biotechnology, supply 
chains and more concludes that — given more research and effort 
—a real paradise of plenty is within reach. Barbara Kiser 
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> calendar problems of the time: to obtain 
information on future days and dates, you 
had to consult an almanac. Carroll found 
mental calculation methods gripping. 
Introducing the piece, he wrote, “I am not 
a rapid computer myself”, yet noted that he 
could do ten such 


problems in less  “Coprpjj 
than four minutes. G-egted a vivid 
His rule uses four 
integer calcula- oaieiatd of . 
tions: two for the OM" presaging 
year, the third for Zi per ki s 
themonthandthe . ah opments " 
last for the day. in the twentiet 
Inanerabefore Comtury and 
calculators, stand- beyond. 


ard arithmetic 

processes were onerous and prone to error. 
Carroll (writing this time under his real 
name) summarized his work on simplify- 
ing ordinary arithmetical calculations in 
his second piece in Nature, ‘Brief Method 
of Dividing a Given Number by 9 or 11’ 


(C. L. Dodgson Nature 56, 565-566; 1897) 
which also included division by 13, 17, 19 
and 41, as well as by numbers within 10 
from a power of 10, either way. The third 
piece, ‘Abridged Long Division’ (C. L. Dodg- 
son Nature 57, 269-271; 1898), by his own 
admission, uses ideas put forth by others that 
he improved on, particularly an accuracy test. 
However, this paper has implications for 
modern computing in its emphasis on mini- 
mizing the number of steps in an algorithm. 

Carroll did not influence his contem- 
porary colleagues in the development of 
mathematical ideas. However, posthu- 
mously, beginning in the last half of the 
twentieth century, his contributions to vot- 
ing theory were uncovered in three papers 
written between 1874 and 1876. The third, 
‘A Method of Taking Votes on More Than 
Two Issues; is the most important. Carroll 
was the first to create a voting method that 
would achieve biproportional representa- 
tion — that is, proportionality with respect 
both to the population in the districts and to 
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A love of puzzles is clear in the call to behead the bodyless Cheshire Cat: what, exactly, would you behead? 
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Carroll’s publisher, Alexander Macmillan. 


the apportionment of seats to the political 
parties in the legislature. Despite Carroll’s 
friendship with Lord Salisbury, the UK 
prime minister at the time, it was not applied 
for political reasons. (Today, the European 
Parliament uses a form of proportional 
representation.) 

Carroll’s work in logic, notably the 
unpublished second part of his book 
Symbolic Logic, foreshadowed results that 
appeared about 100 years later. This long- 
lost section, which contains the method 
of trees, was described by philosopher 
W. W. Bartley (Sci. Am. 227, 38-46; 1972). 
Carroll’s book on linear algebra (An Ele- 
mentary Treatise on Determinants with 
Their Application to Simultaneous Linear 
Equations and Algebraic Geometry, 1867) 
is also groundbreaking. His ‘condensa- 
tion’ method for computing determinants 
sparked research that led to a formulation 
of the alternating sign matrix conjecture by 
David Robbins and Howard Rumsey in the 
1980s. And his 1895 “What the Tortoise Said 
to Achilles; a logic problem he published in 
the philosophical journal Mind, remains 
unsolved. In 1858, he was the first to create 
a cypher in matrix form based on a non- 
standard (modular) arithmetic; it was pub- 
lished more than 100 years later. 

Asa mathematician, logician, writer and 
innovative photographer, Carroll created 
a vivid tapestry of work, knotted, twisted 
and multistranded, and presaging in many 
ways developments in the twentieth century 
and beyond. Yet for all the complexity and 
playfulness of this master gamester’s body 
of work — from voting theory to his great 
creation, Alice, on her long, strange journeys 
towards identity and maturity — his under- 
lying concerns were fairness, certainty and 
truth. = 
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Peat fires: emissions 
likely to worsen 


The horrific haze from Indonesia's 
forest and peatland fires, started 
deliberately to clear land for 
planting and made worse by 
drought, has become a global 
crisis. Indonesia's government 
could stop this annual catastrophe, 
but it so far seems to lack the 
political will to do so. 

In the past decade, Indonesia 
has destroyed its forests faster 
than any other nation (see 
go.nature.com/b9rhxz). By one 
estimate, daily carbon emissions 
from its forest and peatland fires 
now exceed those from the entire 
US economy (see go.nature.com/ 
hpworu). 

The situation is likely to 
worsen: Indonesia and Malaysia 
are planning to set up a Council 
of Palm Oil Producing Countries. 
This intends to force major forest- 
exploiting corporations to relax 
their zero-deforestation pledges 
(see go.nature.com/agvbhn). 
Oil-palm expansion is one of the 
biggest drivers of peatland and 
forest destruction. 

Localized actions and belated 
half-measures by the Indonesian 
government are no longer 
enough. Aided by the global 
community, it must ban fires 
in peatlands and native forests; 
declare a moratorium on clearing 
peatlands; restore water to 
degraded peatlands; and create 
financial incentives for provinces 
to reduce deforestation. 

Susan G. Laurance, William F. 
Laurance James Cook University, 
Cairns, Queensland, Australia. 
susan.laurance@jcu.edu.au 


Peat fires: consumers 
to help beat them out 


Southeast Asia’s choking air 
pollution continues unabated, 
fuelled by the burning of peat 
swamps for agriculture. The issue 
flies in the face of long-standing 
regional agreements on land 
clearance by governments in the 
Association of Southeast Asian 
Nations, and is at last galvanizing 


non-governmental organizations 
(NGOs), banks and businesses 
into action against the companies 
responsible. 

Singapore's punitive 
Transboundary Haze Pollution 
Act has met with some success 
(see, for example, J. H.S. Lee 
et al. Environ. Sci. Policy 55, 
87-95; 2016). The Singapore 
Environment Council, an NGO, 
has suspended environmental 
certification of paper-pulp 
companies that might be 
connected with the fires. This 
has prompted some supermarket 
chains in Singapore to stop 
selling products containing raw 
materials from these companies, 
and banks are reviewing their 
policies for lending to them. 
Suspension could prompt 
companies to become more 
sustainable and to consider 
setting aside undeveloped peat- 
swamp forests for conservation. 

Consumers should back this 
drive for corporate environmental 
accountability by using publicly 
available resources (see, for 
example, www.ethical.org.au) to 
ensure that their product choices 
do not result in peat clearance. 
Lahiru S. Wijedasa National 
University of Singapore; and 
Conservation Links, Singapore. 
Mary Rose C. Posa National 
University of Singapore, Singapore. 
Gopalasamy R. Clements 
University of Malaysia Terengganu, 
Kuala Terengganu, Malaysia. 
lahirux@gmail.com 


Time for Russia to 
tap renewables 


Russia's big territory and coastline 
are potentially huge sources of 
renewable energy from sun, wind, 
waves, tides and currents, but 
about 91% of the country’s energy 
still comes from fossil fuels. 
This must be urgently rectified 
if Russia is to honour its pledge, 
made ahead of this months 
climate summit in Paris, to reduce 
its carbon emissions by 25-30% 
relative to 1990 levels by 2030. 
Russia’s carbon emissions have 
been increasing since 1998. Only 


3.2% of its total primary energy 
supply came from renewables in 
2013 (nuclear accounts for the rest; 
see www.iea.org/statistics). This 
compares poorly with industrial 
nations such as Brazil (40%), 
Sweden (35.7%), India (26.4%), 
Canada (18.6%) China (11%) and 
the United States (6.8%). 

Russian environmental 
legislation is taking small, 
ongoing steps to protect its 
natural resources, clean up 
polluted areas, control air and 
water quality and advance 
green industrial technologies. 
Environmental penalties for 
pollution and illegal use of natural 
resources have increased sharply. 
More investment in renewable 
energy will help to protect Russia’s 
natural environment. 

Alexander Gorobets Sevastopol, 
Russia. 
alex-gorobets@mail.ru 


Cannabis: monitor 
policy changes 


Policy changes that increasingly 
permit the medical and 
recreational use of cannabis 
have important implications for 
society and drug policies overall 
(see Nature 524, 280-283; 2015 
and Nature 525, S1-S18; 2015). 
There is an urgent need to set 
up collaborative international 
monitoring of the effects of these 
changes in different countries to 
achieve a meaningful evaluation 
of their impact. 

Monitoring should include 
differences in arrest numbers, 
imprisonment, public-health 
and social impact, supply and 
economic analyses, as well as 
co-dependence and substitution 
effects on the use of other drugs, 
alcohol and tobacco products (go. 
nature.com/jr5lgt). 

The effects of cannabis policy 
changes might be influenced by, 
for example, a legal age limit, 
accurate labelling of contents 
and potency, restrictions 
on advertising, and law- 
enforcement practices. 

Lucas Wiessing* European 
Monitoring Centre for Drugs and 


Drug Addiction, Lisbon, Portugal. 
lucas.wiessing@emcdda.europa.eu 
*On behalf of 5 correspondents (see 
go.nature.com/tn7ylb for full list). 


Cannabis: debated 
schizophrenia link 


In our view, Matthew Hills 
arguments against a causal 

link between cannabis use and 
schizophrenia fail to clinch this 
debate (Nature 525, S14; 2015). 

His contention that the 
increased societal use of cannabis 
over time is not reflected in 
increased rates of schizophrenia 
has been tested only once to our 
knowledge — and that study 
came to the opposite conclusion 
(J. Boydell et al. Psychol. Med. 

36, 1441-1446; 2006). In 
multifactorial conditions such 

as schizophrenia, an increase in 
one risk factor is not necessarily 
balanced by a decrease in another. 
Deaths from cardiac disease are 
declining in many countries 
despite increased obesity, but 

that does not mean that obesity is 
unrelated to cardiac disease. 

Hill misinterprets our review 
of cannabis use by people with 
psychosis (A. Kolliakou et al. Intl 
J. Dev. Neurosci. 29, 335-346; 
2011). Contrary to his inference 
that this group self-medicates to 
mitigate negative symptoms, we 
found that the most commonly 
reported use by these individuals 
was purely recreational. 

He suggests that cannabis 
is a risk factor only for those 
with a genetic predisposition 
to schizophrenia (but see 
M. Di Forti et al. Biol. Psychiatry 
72, 811-816; 2012). Another 
explanation could be that some 
of the genes associated witha 
proclivity for cannabis smoking 
also show up among those who 
are predisposed to schizophrenia, 
because a genetic tendency for the 
habit could in turn increase the 
risk of schizophrenia. 

Matthew Large University of 
New South Wales, Australia. 
Marta Di Forti, Robin Murray 
Kings College London, UK. 
mmbl@bigpond.com 
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OBITUARY 


Richard Heck 


(1931-2015) 


Organic chemist who won a Nobel for palladium catalysis. 


cc o something with transition 
D metals,” quipped the research 
director of the Hercules Powder 
Company to Richard Heck in 1958, two 
years after he had joined. After consulting 
with Pat Henry, an organometallic chemist 
across the hall, and with some bold thinking, 
Heck discovered a new method for clicking 
together carbon atoms in a single step 
(R. FE. Heck Synlett 18, 2855-2860; 2006). 

The carbon-carbon bond is a prerequi- 
site for life: nature catalyses it with enzymes; 
Heck used palladium. It was the dawn ofa 
new era in organic synthesis, the field dedi- 
cated to constructing a vast array of com- 
pounds, from simple building blocks to 
bewildering mega-atom frameworks. Heck 
had taken the first step on the path that led 
him to the 2010 Nobel Prize in Chemistry. 

Ask an organic chemist today about prod- 
ucts of the Heck reaction and they’ll name 
smartphone displays, sunscreens, perfumes, 
pesticides and medicines. One example is 
the over-the-counter pill naproxen for 
pain, fever, stiffness and inflammation. A 
biologist will recognize the reaction as the 
basis for the coupling of fluorescent dyes 
to DNA bases, allowing the automation of 
DNA sequencing and the elucidation of the 
human genome. 

Richard Heck died on 9 October 2015 in 
Manila. He was born in Springfield, Massa- 
chusetts, on 15 August 1931. Aged eight, he 
moved with his parents, both professional 
dancers, to Los Angeles in California. His 
interest in chemistry was stirred by the vivid 
colours and abundant fragrances of flowers 
in the vacant lot near their home. Following 
a PhD with the prominent physical organic 
chemist Saul Winstein at the University of 
California, Los Angeles, and a postdoctoral 
fellowship with the future Nobel laureate 
Vladimir Prelog at the Swiss Federal Insti- 
tute of Technology in Zurich, the 25-year- 
old Heck joined Hercules (now Ashland) in 
Wilmington, Delaware, in 1956. 

After two years of working on the develop- 
ment of a commercial process for producing 
polyethylene using the newly discovered 
Ziegler-Natta catalysts, Heck was given the 
fateful mission by research director David 
Breslow. “They left us alone to try anything 
we want,’ Heck later said. So, appreciat- 
ing that discovery proceeds stepwise from 
scattered observations in the literature, 
Heck studied the alkene hydroformylation 
reaction. He proposed the first correct 


mechanism for a reaction catalysed by a 
transition metal. It illuminated many other 
unexplained organometallic reactions, 
and produced a rich harvest of new cobalt 
organometallic chemistry. 

The hydroformylation technology is cur- 
rently used to produce 6.8 million tonnes 
of basic carbon building blocks (alcohols 
and aldehydes) each year for the synthesis 
of everyday materials. Perhaps less known 
today are Heck’s other forays into cobalt 
carbonyl chemistry to establish reactions 
with a variety of organic molecules (carbon 
monoxide, alkenes, dienes, epoxides and 
ketones). With, as he put it, “no immediate 
ideas” on how to employ this chemistry prof- 
itably for Hercules, he took a new direction. 

In 1968, the chemical community was 
astounded by Heck’s flurry of seven consecu- 
tive single-author papers in the Journal of the 
American Chemical Society. In hindsight, 
these heralded the innovative work to follow 
when he moved down the road to the Uni- 
versity of Delaware in 1971. The next year, 
Heck’s seminal paper appeared (R. F. Heck 
& J. P. Nolley J. Org. Chem. 37, 2320-2322; 
1972). With characteristic generosity, he 
begins by acknowledging Tsutomu Mizoroki 
for preceding his discovery. He goes on: “We 
have independently discovered this reaction 
and find that it can be carried out under much 
more convenient laboratory conditions.” 

The discovery, now also widely known 
as the Mizoroki—Heck reaction, involves 
the palladium catalyst slipping into a car- 
bon-halogen bond to give a fleeting species. 
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This intermediate clutches another reactive 
molecule to form, after more contortions, a 
product in which a new carbon-carbon bond 
is established. Although this paper led to the 
Nobel prize in 2010 (with Ei-ichi Negishi 
and Akira Suzuki), it lay mostly unappre- 
ciated in the literature. Heck continued on 
his innovative path with two further publi- 
cations, in 1975, which revealed two other 
new ways of carbon-carbon bond formation 
that are harbingers of the Sonogashira and 
Suzuki-Miyaura cross-coupling reactions. 
“I reported the copperless Sonogashira,’ Dick 
Heck said to me unpretentiously. 

During this prodigious run of ground- 
breaking research, he and his co-workers 
established two other mainstays of the syn- 
thetic chemists’ toolbox: palladium-catalysed 
carbonylation of aryl halides and transfer 
hydrogenation with formate as a reductant. 

Today, undergraduates learn the Heck 
reaction in class and laboratory; industrial 
chemists practise it to make tonnes of drugs 
against asthma, diabetes and AIDS, among 
others. Thus, Heck’s work may be considered 
as a forerunner of a cornucopia of transition- 
metal-catalysed technologies that are in 
operation worldwide. 

In 1989, he retired to Florida with his Fili- 
pino wife Socorro Nardo. In 2006, I invited 
him to Queen's University, Canada, to follow 
up on his cobalt work. Students overcame 
their awe to work alongside him, striving 
to arrive before his customary 8 a.m. start. 
Seventeen years after retirement and 45 years 
after opening the door to organocobalt 
chemistry, Dick entered the lab again, pre- 
pared the complexes and measured their 
infrared spectra (“you get all of the informa- 
tion you need”). With the assistance ofa post- 
doc, he obtained nuclear magnetic resonance 
and high-resolution mass spectrometry data, 
interpreted them, and took the next step. 

Dick returned later in 2006 to the Philip- 
pines with Socorro. A handwritten letter to 
me noted that he had returned to his two 
passions: “I have some room to grow orchids 
again so I will have something to do,’ and 
“Have you found anyone to work with cobalt 
carbonyl?” He had drawn chemical structures 
of potential next steps along the palladium- 
paved path that he had established. m 
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Extra dimension for bone analysis 


A combination of two techniques — computed tomography and small-angle X-ray scattering — and serious computing 
power have enabled multi-scale, three-dimensional analysis of bone and tooth tissue. SEE LETTERS P.349 & P.353 


PETER FRATZL 


and by Schaff et al.”) describe differ- 

ent approaches to visualizing three- 
dimensional bone and tooth structures 
on both macroscopic and nanometre 
scales. The methods derive from com- 
puted tomography, a well-established 
imaging technique that can yield 3D 
pictures of bones, but which quantifies 
just one scalar parameter, such as mineral 
density, in each voxel (3D pixel) of the 
image. Using the new techniques, each 
voxel contains information on both the 
orientation and the size of mineral parti- 
cles in the bone or tooth specimen. 

The strength of bone is determined 
by its structure on all scales, particularly 
by local variation in the size, amount 
and orientation of bone mineral parti- 
cles’. Formed from calcium phosphate, 
these mineral platelets are just a few 
nanometres thick and are embedded in 
a matrix of collagen molecules (Fig. 1, 
inset). Because bone tissue is constantly 
remodelled and adapted, the orientation 
of mineralized collagen fibrils varies in a 
complex fashion throughout the tissue’. 

Although transmission electron 
microscopy provides sufficient resolu- 
tion to visualize the mineral particles, it 
does not allow them to be mapped over 
millimetre distances in complete bone 
sections. For this reason, a technique 
called small-angle X-ray scattering 
(SAXS) has been used in a ‘scanning’ 
mode since the 1990s to study bone 
taken from patient biopsies and ani- 
mals”®. In this method, a bone specimen is 
moved in two dimensions (defined by the 
x and y axes in Fig. 1) across a narrow X-ray 
beam, and at each position of this scan a SAXS 
pattern is collected. This allows the simulta- 
neous visualization of two different scales: the 
structure, particularly the orientation, of the 
mineralized collagen fibrils in every pixel of 
the scan; and the variation of fibril orientation 
across a macroscopic specimen. 

When a 2D X-ray detector is used, as 
in the early implementations of scanning 
SAXS, it effectively provides 4D information: 


I n this issue, two papers (by Liebi et al.’ 
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Figure 1 | Principle of small-angle X-ray scattering (SAXS) 
tomography. Two papers’ report methods for SAXS 
tomography that enable multi-scale analysis of bone and tooth 
structures. A sample (here, a bone specimen) is scanned by a 
narrow X-ray beam in the x and y directions for various rotation 
angles g around an axis through the sample. The scans provide 
2D maps that are combined to produce a 3D reconstruction of 
the bone structure. Unlike conventional methods for visualizing 
bones, SAXS tomography provides both a 3D macroscopic map 
of the sample and 3D information on the nanoscale mineral 
particles embedded in the collagen molecules of bone (such as 
their predominant direction of alignment; purple arrow in inset). 


a macroscopic 2D map of the bone tissue and 
2D information about the structure of the 
mineralized collagen fibrils in every pixel of 
the map’. It is straightforward to extend this 
to five dimensions by rotating the bone section 
around an axis to collect 3D information on 
the mineralized collagen fibrils in each voxel, 
while the mapping stays 2D*. 

It would be desirable to extend this to 
six dimensions by enabling 3D macroscopic 
mapping of the specimen. But the reconstruc- 
tion of such 6D data is a formidable numeri- 
cal challenge and can be solved only through 
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the use of certain approximations and 
substantial computing power. In con- 
ventional computed tomography, vol- 
umes are reconstructed from 2D images 
taken in the x-y plane for many rotation 
angles g around a given axis (Fig. 1). For 
SAXS tomography, 2D SAXS data must 
be collected at each position of the x-y 
plane and at many rotation angles, not 
just around one axis but around many 
axes. This results in a huge number of 
measurements and requires massive 
computational effort. 

To make such efforts tractable, Liebi 
et al. (page 349) take advantage of certain 
symmetries in the mineralized collagen 
fibrils of bone. More specifically, they 
assume that the arrangement of mineral 
particles is rotationally symmetrical — 
it looks the same after a certain amount 
of rotation — around the direction in 
which collagen molecules align in each 
fibril. This symmetry imposes con- 
straints on the SAXS patterns that aid 
the reconstruction of 3D images using 
a procedure sometimes called tensor 
tomography. 

Schaff and colleagues’ approach 
(page 353) assumes that the SAXS sig- 
nal varies slowly with rotation angle. 
This means that fewer rotation angles are 
needed during data collection, because 
data values between rotation angles can 
be interpolated. The authors used their 
technique to study dentine in the interior 
of a tooth. Both techniques’” produce 
3D pictures of macroscopic specimens, 
in which each voxel contains an arrow 
that represents the direction of the 
predominant orientation of the mineralized 
collagen fibrils in the voxel. 

A previous attempt’ to develop 3D scanning- 
SAXS tomography was devised for materials 
with structures that have rotational symmetry 
around one axis, such as composite materi- 
als based on parallel fibres. This approach is 
similar to that of Liebi and colleagues, but it 
assumes that all the fibres point in the same 
direction throughout the specimen, which is 
not true for bone. Another recently reported 
approach” applies normal scanning SAXS to 
a series of consecutive thin bone sections cut 


from one block; by assembling the resulting 
series of scanning SAXS data, a full 3D image 
was reconstructed. The obvious disadvantage 
of that technique compared with the cur- 
rently reported ones is that the bone sample 
was destroyed. 

Both Liebi et al. and Schaff et al. collected 
more than one million SAXS patterns for their 
reconstructions — equating to terabytes of 
data, an impressive amount. Recording them 
using a synchrotron X-ray source took about 
a day, and the computations needed for one 
tomogram required several days of computing 
time. This is manageable for proof of principle 
of the techniques, but would be prohibitive for 


aS 


clinical studies, for which many tomograms 
would be required. 

Nevertheless, the feasibility of the techniques 
has been demonstrated. With the progress 
currently foreseeable in the development of 
X-ray sources, detectors and computing power, 
one can therefore expect SAXS tomography to 
become an important tool for the analysis of 
bone, dentine and other mineralized tissues in 
biological and medical studies. m 


Peter Fratzl is in the Department of 
Biomaterials, Max Planck Institute of Colloids 
and Interfaces, Potsdam 14424, Germany. 
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Homed to the hideout 


Some Staphylococcus aureus bacteria are thought to survive standard antibiotic 
treatment by ‘hiding’ in host cells. But an antibody- antibiotic conjugate has been 
developed that targets these bacteria in mouse models. SEE ARTICLE P.323 


WOLF-DIETRICH HARDT 


he pathogenic bacterium Staphylococcus 
aureus causes thousands of deaths each 
year. Therapy is sometimes unsuccess- 
ful, partly because antibiotic-resistance 
genes are spreading worldwide. However, 
even strains of S. aureus that lack resistance 
genes are often difficult to kill with avail- 
able antibiotics; it has been suggested that the 
bacteria ‘hide’ inside host cells. This hypoth- 
esis inspired Lehar et al.', who, on page 323 
of this issue, present a construct in which an 
antibiotic is linked to an antibody that binds 
to the pathogen’s surface. Alone, this ‘prodrug’ 
is inactive, but when prodrug-coated bacteria 
enter host cells, enzymatic activity releases the 
antibiotic. In mouse models of S. aureus infec- 
tion, this strategy was strikingly more potent 
than standard antibiotic treatment. 
Antibiotics are a pillar of modern medicine, 
but they are not effective in all cases. There 
are at least three explanations for this. First, 
important pathogenic bacteria, including 
many S. aureus strains, have acquired resist- 
ance against standard antibiotics”. Second, the 
pathogen may hide in host sites that cannot be 
reached by antibiotic molecules, or where the 
environmental conditions, such as high acidity, 
render the antibiotics inactive. And third, in 
certain circumstances, bacteria can switch to 
a ‘persistent lifestyle that makes them insensi- 
tive to antibiotics’. The switch to persistence is 
still not completely understood but can occur 
in some pathogens when they enter host cells. 
In S. aureus, a range of virulence factors 
manipulate host-cell processes, prohibit effi- 
cient immune responses and fuel infections 


in wounds, the bloodstream and other sites. 
In the absence of antibiotic-resistance genes, 
several different classes of antibiotic are 
used to treat S. aureus infections; rifampicin 
antibiotics are sometimes employed to target 
intracellular reservoirs of the bacterium. How- 
ever, these classic antibiotics are often unable 
to cure the infection. 

Lehar and colleagues speculated that this 
failure is due to either insufficient accessibility 
of the antibiotic or insufficient activity 
against intracellular S. aureus. In an attempt 
to overcome these problems, the researchers 
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first generated a rifampicin derivative 
(a ‘rifalogue’) with altered physicochemical 
properties that gives superior activity against 
S. aureus cells that have switched to a persis- 
tent lifestyle. Next, they identified an antibody 
that tightly binds to sugar structures found on 
the surface of all S. aureus strains tested. Then 
they covalently joined these two components 
by using a chemical bridge that can be broken 
by protease enzymes that are present at the 
intracellular sites where S. aureus is thought to 
hide out (Fig. 1). Strikingly, in a mouse infec- 
tion model, this antibody-antibiotic conjugate 
(AAC) was much more effective at reducing 
pathogen loads than two conventional anti- 
biotics currently used to treat recalcitrant 
S. aureus infections. 

This approach is reminiscent of antibody- 
targeted prodrug strategies that are currently 
used in cancer therapy’, and these proof-of- 
principle data suggest that targeted antibiotic 
delivery is a promising strategy for fighting 
obstinate intracellular pathogens. It remains 
to be seen whether AACs are as efficient at 
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Figure 1 | Targeted intracellular antibiotic release. a, Staphylococcus aureus infections are notoriously 
difficult to treat. It is thought that this is because the bacteria enter host cells and ‘hide’ in intracellular 
compartments that conventional antibiotics cannot reach or where they are inactive. b, Lehar et al! 
covalently linked an antibiotic derivative, called a rifalogue, to an antibody that binds to components of 
the S. aureus cell wall. This prodrug coats the bacterial cell surface but remains inactive until the bacteria 
enter the host cell. There, protease enzymes cleave the linker region, releasing the active antibiotic, which 


then kills the bacteria. 
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treating bacterial infections in humans as they 
are in mice, especially in chronically infected 
patients, who often already have antibodies 
against S. aureus. Such antibodies may shield 
the bacterial surface from AAC binding, and 
therefore interfere with the targeting of the 
prodrug. Moreover, because the antibiotic 
makes up only around 1% (by mass) of the 
current construct, the AAC would have to be 
applied at the equivalent of more than a gram 
per dose for an adult human patient. This might 
be improved in the future by replacing the anti- 
body with smaller surface-targeting entities. 
Why is the AAC approach so much more 
effective than standard antibiotics? One rea- 
son is that the rifalogue is more efficient than 
rifampicin at killing persistent S. aureus cells. 
Another is that the kinetics of drug distribution, 
excretion and inactivation seem to be favour- 
ably affected by its fusion to the antibody. Coat- 
ing bacterial cells with the antibody-bound 
prodrug may also steer the bacteria to be taken 
up into intracellular compartments (lysosomes) 
that have high levels of the enzymes needed to 
release the antibiotic’. And accumulation of 


ASTROPHYSICS 


the AAC on the pathogen’s surface may cause 
particularly high local concentrations of the 
bacteria in the intracellular hideout. It remains 
to be seen which of these mechanisms account 
for the in vivo potency of the AAC. 
Compared with conventional antibiotic 
therapy, the prodrug approach is likely to 
reduce both the emergence of antibiotic resist- 
ance (by reducing the exposure of other bac- 
teria to the active drug) and the disruption of 
the body’s normal communities of microorgan- 
isms. There is still plenty of scope for optimiz- 
ing the targeting moieties and the chemical 
bridges®. Moreover, the strategy may allow 
researchers to revisit older antimicrobials that 
were not developed for therapy because they 
had unfavourable pharmacokinetics or toxic- 
ity. The AAC approach could also expand our 
arsenal against other notorious intracellular 
pathogens, such as Mycobacterium tuberculosis. 
Alternative strategies to tackle the growing 
problem of antibiotic resistance are also 
emerging. These include antibiotics that spe- 
cifically target persistent cells’, agents that 
stimulate the host’s antimicrobial defences 


Growing planet 
brought to light 


Thousands of extrasolar planets have been discovered, but none is a planet in its 
infancy. Observations have finally been made of a young planet growing in its 
birthplace — opening the way to many more such discoveries. SEE LETTER P.342 


ZHAOHUAN ZHU 


places is extremely challenging, because 

actively forming planetary systems are 
far away and obscured by dust. On page 342 
of this issue, Sallum et al.' report the use of a 
new technique to detect an emission signal 
from a growing planet. This discovery has far- 
reaching implications for our understanding 
of the planet-formation process and of the 
properties of young planets. 

When a star is born, a flat rotating disk of 
gas and dust forms around it, known as the 
circumstellar disk. This disk continuously 
transports dust and gas inward to feed the 
young star for millions of years, a process 
known as disk accretion. Planets are thought 
to form from the leftover material from this 
disk. Earth, for example, was born in the 
circumstellar disk surrounding the young Sun 
4.6 billion years ago. But little is known about 
how microscopic dust particles can grow 14 
orders of magnitude bigger to become a giant 
planet within the relatively short lifetime of the 


Pee: young planets in their birth- 


disk. Finding young planets in circumstellar 
disks should provide important clues about 
when, where and how young planets are born. 

But finding planets is difficult because 
they are small and dim. The presence of most 
known planets has therefore been inferred 
from observations of the stars around which 
they revolve. For instance, the Kepler satel- 
lite’ has discovered more than 1,000 planets 
by measuring the tiny dimming of stellar light 
that occurs when a planet passes in front of 
its star. 

Such methods cannot be used to find young 
planets around young stars, because such stars 
are highly active and the light they emit is 
variable. Most attempts to find young planets 
use 10-metre-diameter optical telescopes to 
directly image planets in circumstellar disks. 
Disks with large cavities are particularly tar- 
geted’, because such cavities are thought to be 
opened up by giant planets in orbit around the 
central star. 

In 2012, a protoplanet candidate 1,000 times 
fainter than its host star was discovered* ina 
system called LkCa 15 (Fig. 1). The central 
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to augment antibiotic therapy*”, or harmless 
‘biocontrol agents that colonize the host and 
inhibit pathogen growth”. We can hope that 
such approaches, alongside the AAC strategy 
presented by Lehar et al., will boost our 
capacity to treat bacterial infections. m 
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star of this system is similar to our Sun, but is 
only 2 million years old. The Sun-like star has a 
circumstellar disk with a cavity 50 astronomi- 
cal units in radius (1 au is the distance from 
Earth to the Sun). 

The protoplanet candidate, called LkCa 15 b, 
resides inside the cavity, 16 au away from the 
central star. But the nature of LkCa 15 b is 
unclear — it is redder than would be expected 
for a young planet. Several young planet 
candidates”® have since been discovered in 
other circumstellar disks, and they are all 
quite red. This has led to the hypothesis that 
the detected red objects are young planets 
with circumplanetary disks. When a planet 
is born, a rotating circumplanetary disk of 
gas and dust forms around it, similar to the 
circumstellar disks around young stars. As 
the accreting disk feeds the nascent planet, it 
releases energy and becomes bright. The emis- 
sion from such a disk should be redder than the 
planet itself”. 

In their study, Sallum et al. searched for a 
signature of young planets: Ha photons, which 
are emitted from hydrogen atoms only when 
a circumplanetary disk accretes onto a planet. 
If a young planet has strong magnetic fields, 
the fields form a large magnetosphere around 
the planet, which can truncate the circum- 
planetary disk”*. The material in the disk 
therefore has to follow the planet’s magnetic 
fields to accrete onto the planet. During this 
accretion process, the magnetosphere can be 
as hot as 10,000 kelvin, which is what causes 
hydrogen atoms to emit Ha photons’. 

Although the emission of Ha photons has 
been widely observed when circumstellar 
disks accrete onto young stars, Sallum and col- 
leagues are the first to directly image accreting 


LkCa 15 


Figure 1 | Circumplanetary-disk discovery. The young star LkCa 15 is surrounded by a disk of dust and 
gas. Sallum et al.' report that a young planet (LkCa 15 b) is growing in a gap in that circumstellar disk, and 
that two other potential young planets (LkCa 15 c and d) also reside within the gap. Disks of dust and gas 
also form around young planets (inset), providing material for them to grow continuously. When material 
from a circumplanetary disk follows the magnetic fields of young planets (blue curves) to be accreted onto 
those planets, it produces light known as Ha photons. The authors report that LkCa 15 b is an Ha emitter. 
This graphic is based on supercomputer simulations of the gas distribution in the LkCa 15 system; the 
central star and planets are not shown to scale. (Graphic modified from images provided by Z. Zhu.) 


circumplanetary disks around young planets 
using Ha photons. To do this, they used a filter 
that allows only Ha photons to reach their tele- 
scope’®. The authors report that LkCa 15 bis an 
Ha emitter, providing strong evidence that it 
is a young planet with a circumplanetary disk 
still accreting onto the planet. Furthermore, 
they found two other objects inside the cavity 
of the LkCa 15 system, although these do not 
seem to be Ha emitters. By combining infor- 
mation from observations made over several 
time periods with data from the initial discov- 
ery in 2012, Sallum et al. determined the orbits 
of two of the young planet candidates. 

The researchers’ discovery provides 
stringent constraints on planet-formation 
theories. For example, such theories now 
have to explain how a giant planet can form 
15-16 au from its star within 2 million years, 
and still be growing after this time. Another 
implication of the findings is that a young plan- 
et’s magnetic fields need to be at least 20 times 
stronger than Jupiter’s current magnetic fields 
to truncate the accreting circumplanetary disk. 
This in turn implies that the internal motion of 
young giant planets is much greater than that of 
the giant planets in the Solar System, and pro- 
vides an indirect probe of the internal structure 
of such planets. 

Both the red colour and the Ha emission 
from LkCa 15 b can be explained by the pres- 
ence of an accreting circumplanetary disk, 
but some caveats should be kept in mind. Our 
knowledge of Ha emission from accretion 
disks builds on data from disks around young 
stars that are hundreds of times more massive 
than planets. Sallum and co-workers therefore 
extrapolate the known relationship between 
Ha flux and disk-accretion rate to a com- 
pletely new size scale. Measurements of other 
accretion tracers would be desirable, such as 
‘continuum emissions at ultraviolet and optical 


wavelengths’. The nature of the two sources 
that do not emit Ha photons also remains 
unclear. Follow-up observations, especially at 
mid-infrared and submillimetre wavelengths, 
are needed to clear up these issues. 
Nevertheless, the authors have demon- 
strated a powerful technique to find young 
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planets in circumstellar disks, one that will 
discover many such planets in the future. 
This would potentially allow the distribu- 
tion and occurrence of young planets to be 
determined with a comparable accuracy to 
that for the mature exoplanets discovered by 
the Kepler satellite. Such an understanding of 
the young planet population will shed light on 
the decades-old problem of planet formation, 
and reveal how young planetary systems can 
evolve into older ones suchas our Solar System, 
billions of years after they were born. = 
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On the crest of 
becoming vertebrate 


The discovery of cells in an invertebrate that share several features with 
vertebrate neural-crest cells provides insights into how this vital vertebrate cell 
population might have evolved. SEE LETTER P.371 


MARIANNE E. BRONNER 


he evolution of vertebrates is intimately 

linked to the advent of an embryonic 

cell population called the neural crest. 
Neural-crest cells arise in the central nervous 
system (CNS) and then invade the periphery 
of the vertebrate embryo, where they differen- 
tiate to form a wide range of cell types, from 
facial cartilage and bone, to pigment cells of the 
skin, to neurons and glial cells of the periph- 
eral nervous system. The evolution of these 
features is thought to have imbued vertebrates 
with their predatory ability’, facilitating their 
success on Earth. Although all vertebrates have 
neural-crest cells, how this population evolved 
has remained a mystery. In this issue, Stolfi 
et al.” (page 371) report that a type of neuron 
in an ascidian (sea squirt) — an invertebrate 


filter feeder that is a close relative of vertebrates 
— has intriguingly similar features to neurons 
derived from the vertebrate neural crest. 

The neural crest is characterized by its 
origin in the CNS, its migratory behaviour and 
its ability to differentiate into many cell types 
(multipotency). Although all animals of the 
chordate phylum, which includes vertebrates, 
have a similar body plan (including a dorsal 
CNS; a structure called the notochord that 
runs down the midline of the embryo; and a 
segmented trunk), invertebrates lack cells 
that have all the characteristics of the neural 
crest. So far, no intermediate cell type — which 
would be expected to originate from the CNS 
and become migratory — has been identified 
in invertebrates. 

Stolfi and colleagues investigated the origin 
of bipolar tail neurons (BTNs) in the ascidian 
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Figure 1 | Evolution of the neural crest. This simplified phylogenetic tree depicts the evolution of 

the chordate lineage. An embryonic cell population called the neural crest, which migrates extensively 
and forms many cell types, arose with the advent of vertebrates. Whereas jawless and jawed vertebrates 
possess neural-crest cells, this population is not present in cephalochordates or urochordates. Stolfi 

et al.” have demonstrated that a cell population in the urochordate Ciona intestinalis shares some 
characteristics with vertebrate neural-crest cells. This may be an intermediate cell population from which 


the neural crest evolved. (Drawings taken from ref. 8.) 


Ciona intestinalis. They found that BT'Ns 
arise from precursor cells that originate in the 
developing CNS and migrate through adjacent 
tissues before differentiating to form mature 
neurons. Furthermore, the authors discovered 
that BTNs have a similar function to sensory 
neurons in vertebrates. 

This study is a good complement to previous 
work’ demonstrating that precursors to pig- 
ment cells are also present in ascidians. These 
precursors normally remain in the CNS, but can 
be induced to migrate through misexpression 
of just one gene, Twist. Taken together, these 
two studies point to the intriguing hypothesis 
that the evolution of vertebrates might have 
involved precursor cells in the CNS ofa chor- 
date ancestor gaining the ability to form many 
cell types — having already developed differen- 
tiation programs for forming some neural-crest 
derivatives, including sensory neurons and pig- 
ment cells. Moreover, by demonstrating that 
BTN precursors migrate from the CNS before 
differentiating, Stolfi et al. provide evidence that 
these cells have analogous traits to two major 
characteristics of the vertebrate neural crest. 

The authors also found that BTN precursors 
have similar genes to those that encode the ver- 
tebrate transcription factors Neurogenin and 
Islet, which are both required for the formation 
of sensory neurons. However, the cells do not 
express all the genes expressed by the neural 
crest, raising the possibility that BT'N precur- 
sors represent an intermediate cell population 
capable of some, but not all, neural-crest-like 
behaviours. 

In contrast to ascidians, more-primitive 
chordates such as amphioxus, a cephalochor- 
date, lack any precursor cells with neural-crest- 
like characteristics. These species do, however, 
have neurons and pigment cells’. Moreover, 
the genomes of all invertebrate chordates, 
including amphioxus’, harbour genes that 
are similar to those involved in neural-crest 
formation in vertebrates. Thus, vertebrate 


evolution did not require the invention of new 
genes. Rather, progressive changes that rewired 
the regulation of gene circuits by using existing 
factors in new ways probably permitted each 
transition, from cephalochordates that have no 
neural crest, to urochordates such as ascidians 
that have an intermediate cell population with 
some neural-crest-like characteristics, to the 
base of vertebrate evolution, with the advent 
of bona fide neural-crest cells (Fig. 1). 

Key questions remain to be answered. For 
example, it is not clear whether the gene- 
regulatory networks in the sensory-neuron 
precursors of ascidians are similar to those 
of vertebrates. Because signalling pathways 
that mediate differentiation are thought® to 
be crucial for conferring multipotency on the 
neural crest, it will be interesting to determine 


how these signalling pathways came under 
the regulatory control of the genes that induce 
the formation of neural-crest cells in ver- 
tebrates. Finally, although C. intestinalis is 
useful for experimental analysis, it is highly 
derived — its genome is much simpler than 
that of other chordates. Thus, analogous stud- 
ies should be performed in other, less-derived 
ascidians, and in other chordates at different 
positions on the evolutionary tree. 

The idea that cell-differentiation programs 
have come under the progressive control of 
neural-crest genes during vertebrate evolu- 
tion is not new’. It has long been thought 
likely that evolutionary precursors of neural- 
crest cells lacked the multipotency that defines 
this cell population in vertebrates. Stolfi and 
colleagues’ study neatly demonstrates the 
existence of an intermediate cell type that might 
have gained this ability, enabling the evolution 
of the neural crest. = 
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Organ-seeking vesicles 


An analysis reveals that cancer cells remotely prepare distant sites for 
tumour spread in an organ-specific manner, by deploying organ-seeking 


extracellular vesicles. SEE ARTICLE P.329 


JANUSZ RAK 


he metastatic dissemination of cancer 

cells from their site of origin through the 

bloodstream to distant organs is a major 
cause of cancer-related deaths. This process is 
not random’; instead, certain populations of 
cancer cells preferentially seek out and colonize 
specific organs’, under the control of a range of 
molecular programs’. Such homing implicitly 
involves interactions between cancer cells that 
escape the primary tumour, sometimes known 
as seeds, and the microenvironment, or ‘soil; of 
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target sites’. But less intuitive is the discovery by 
Hoshino et al.’, described on page 329 of this 
issue, that seeds can influence the soil before 
their arrival, sending out extracellular vesicles 
called exosomes that precondition specific 
organs for metastatic invasion. 

There is growing support for the provocative 
notion that a build-up of systemic responses 
to a primary tumour might precede, and 
even enable, the eruption of metastatic can- 
cer. These responses might involve complex 
alterations in the body’s vascular, coagulation 
and inflammatory systems — for example, 


cancer-related changes in the composition of 
soluble proteins, in cell populations’ or in the 
characteristics of exosomes’ in the blood. 

Hoshino et al. define exosomes as small 
extracellular vesicles° — membrane-bounded 
compartments that transport proteins, lipids 
and nucleic acids’ from one cell to another, 
and which can travel considerable distances in 
bodily fluids or the bloodstream. This infor- 
mation-transfer process has attracted con- 
siderable interest in cancer research, because 
some extracellular vesicles carry cancer- 
causing genes called oncogenes, or oncogenic 
proteins that promote cancer formation and 
disease progression®. 

The involvement of extracellular vesicles, 
including exosomes, in metastasis has been 
studied for some time””®, and contributes 
to several key events that prepare a distant 
site for colonization — a process called pre- 
metastatic niche formation"’. For example, in 
a mouse model of melanoma, contact between 
exosomes and the capillary wall triggers vascu- 
lar permeability, which enables cancer cells to 
escape from the blood vessel into a new site’. 
In addition, these exosomes can transfer the 
oncogenic MET receptor protein to circulat- 
ing blood cells called myeloid cells, altering 
the cells’ behaviour such that they condition 
premetastatic sites for subsequent colonization 
by cancer cells’. In pancreatic cancer, circulat- 
ing exosomes transfer migration inhibitory 
factor protein to immune cells called Kupffer 
cells in the liver, triggering a cascade of events 
that results in premetastatic niche formation”. 

Although these results indicate that 
exosomes can promote metastasis in general, 
whether and how exosomes are involved in 
organ-specific metastasis has not been exten- 
sively investigated. To explore this question, 
Hoshino et al. asked whether cancer-cell types 
known to preferentially home to the lung, 
liver, brain or bone might produce exosomes 
that selectively interact with the same organ. 
Remarkably, this is precisely what they 
observed. When exosomes from cancer cells 
were injected into mice, they became lodged 
in the organ to which those cancer cells are 
prone to metastasize. Furthermore, the organ- 
seeking exosomes interacted with different cell 
types. For instance, exosomes that targeted the 
lung became lodged in the epithelial cells that 
line the organ’s interior, whereas liver-targeting 
exosomes entered Kupffer cells. 

Hoshino et al. injected mice with exosomes 
followed by cancer cells from the same cell 
line, and demonstrated that the exosomes 
promoted organ-specific metastatic growth. 
They then made a tantalizing observation 
— exosomes taken from breast-cancer cells 
that metastasize to the lung could redirect 
another cancer-cell population to dissemi- 
nate in the lung, when it would normally 
home to the bone. This discovery strengthens 
the notion that the metastatic characteristics 
of cancer cells are not autonomous, but can 
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Figure 1 | Paving the way for organ-specific metastasis. a, Small extracellular vesicles called exosomes 
bud off from cancer cells in a primary tumour and enter the bloodstream, transporting proteins, lipids 
and nucleic acids to distant cells in the body. Hoshino et al.’ report that exosomes derived from different 
cell types within a mixed population of cancer cells can display different integrin proteins on their 
surface. This integrin profile promotes adhesion with cells at specific target sites — exosomes displaying 
the integrin a,$, preferentially interact with cells in the lung, whereas a,8; directs exosomes to the liver. 
b, The contents of the exosome trigger cellular changes in the target organ that condition the site for 
metastasis. Thus, exosomes promote organ-specific invasion and metastatic growth of the cancer-cell 


type from which they originated. 


instead be influenced by external factors. 

The authors provide several clues to how 
exosomes orchestrate organ-specific metasta- 
sis. They found that exosomes targeting dif- 
ferent sites displayed different cell-adhesion 
receptor proteins called integrins on their 
surface. The integrin profile of each exosome 
subtype facilitated its uptake into organs in 
which an abundance of ligand for that integ- 
rin was produced. For instance, a,8; integrin 
directed exosomes to the liver, whereas a,8, 
promoted homing to the lung (Fig. 1). Fur- 
thermore, inhibiting the exosomal expression 
or binding of integrins limited organ-specific 
metastasis. Finally, the authors found evidence 
that invasion of target organs by exosomes trig- 
gered the production of $100 proteins, which 
promote inflammation and cell migration, and 
activation of the protein Src — responses that 
precondition host cells for metastasis. 

These fascinating observations expand our 
understanding of organ-specific metastasis. 
However, further investigation is required to 
establish whether and how this knowledge can 
be put to practical use. The authors demon- 
strate that integrin expression might predict 
metastatic spread, pointing to the possibility 
that exosomal integrin profiles could be used 
in cancer diagnostics. Their data also indicate 
that integrin inhibitors might curtail meta- 
static spread to specific organs. But in many 
cases, advanced cancers disseminate to several 
sites’, limiting the potential of therapeutics 
that work in an organ-specific manner. 


It is also worth considering that the molecular 
pathways that induce metastasis, both exosome- 
dependent and -independent, are probably 
extremely diverse. As such, they might be 
triggered by many context-specific factors: 
the activation of differentiation pathways in 
cancer cells; the emergence of a particular 
molecular subtype within a tumour; therapeu- 
tic interventions and more. For example, the 
incidence of brain metastasis differs between 
molecular subtypes of breast cancer, and tends 
to be higher in those driven by the oncogenic 
protein ERBB2, even after effective treatment 
with ERBB2 inhibitors’*. Whether, and how, 
ERBB2, its antagonists and the therapies used 
to treat ERBB2-driven cancers might influ- 
ence the emission of organ-seeking exosomes 
is unknown, and is of great interest. Similarly, 
inflammation, abnormal clotting and other 
cancer-associated changes in physiology might 
interfere with the organ-seeking mechanism 
of exosomes and cells — and must be taken 
into account when analysing routes of meta- 
stasis. Thus, much remains to be understood 
about the fascinating part that organ-specific 
exosomes might play in fertilizing the meta- 
static soil in different human cancers. m 
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Boost for movement 


By electrically stimulating the motor neurons of rats that have spinal-cord 
injury, in bursts that are attuned to the times at which the neurons receive 
voluntary motor commands, the animals’ recovery can be improved. 


RANDOLPH J. NUDO 


Pp eople with spinal-cord injury face a host 


of challenges, including sensory, bowel, 

bladder and sexual dysfunction, paraly- 
sis and weakness. Although rehabilitation 
can help to improve motor and sensory func- 
tion, recovery is limited. But writing in Pro- 
ceedings of the National Academy of Sciences, 
McPherson et al.' show that rats with spinal- 
cord injury can recover substantial motor 
ability when treated with a new type of electri- 
cal-stimulation therapy. 

Rehabilitation from spinal-cord 
injury relies on the damaged nerv- 
ous system adapting over time. For 
example, when a person with motor 
defects in their arm repeatedly prac- 
tises hand grasps, their ability to do 
this improves as a result of changes 
in neural circuitry. This alteration 
in nervous communication patterns 
and connections to regain function 
is called neuronal plasticity. Thera- 
peutic approaches that encourage 
neuronal plasticity therefore have the 
potential to improve recovery when 
combined with more-conventional 
rehabilitation strategies. 

In one such approach, known 
as electrical stimulation, a barrage 
of electrical excitation is given to 
undamaged spinal-cord projections 
(called fibres) and the neurons that 
they target, boosting the body’s 
own weak electrical inputs to these 
circuits. In addition to promoting 
neuronal plasticity, this technique can 
help to uncover latent motor func- 
tions that are presumably inactive 
owing to insufficient levels of excita- 
tion. For example, it has been used to 
demonstrate’ that even people with 
‘complete’ spinal-cord injuries (who 
are paralysed below the level of the 
injury) retain some latent functional 
potential. 


Interneuron S$ ee 


In the mid-twentieth century, the neuro- 
psychologist Donald Hebb predicted that 
the synaptic connections between neurons 
can be altered, becoming stronger or weaker 
depending on the timing of their activation’. 
If the presynaptic neuron is activated before 
the postsynaptic neuron, then the synaptic 
connection is strengthened, increasing acti- 
vation of the postsynaptic neuron in response 
to signals from the presynaptic neuron. This 
theory is summarized in a modern maxim’: 
“cells that fire together, wire together” Thanks 
to improvements in microelectronics, it is now 


Corticospinal 
neuron 


Spinal cord 


Motor 
neuron 
Forelimb 
muscle 
Electrical 
stimulus 
Muscle 


recording 
Interface 


Figure 1 | Precisely timed spinal-cord stimulation. Voluntary- 
movement commands are conveyed to the spinal cord by corticospinal 
neurons that originate in the motor regions of the brain's cerebral cortex. 
These neurons terminate on spinal-cord interneurons, which in turn 
project to motor neurons that excite muscle. After spinal-cord injury, 
corticospinal neurons are damaged, impairing limb function. However, 
some neurons can escape injury. McPherson et al.’ investigated recovery 
from spinal-cord injury in rats. An integrated electronic interface 
recorded electrical activity in limb muscle (indicating contraction of 

the muscles) and applied a carefully timed electrical stimulus to the 
spinal cord. The stimulus arrived shortly after the weakened voluntary 
command from the surviving corticospinal neurons. This excitation 
helped to restore limb motor function, possibly by strengthening the 
neuronal connections in the spinal cord. (Adapted from ref. 1.) 
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possible to create small devices that record 
electrical biological signals and then use those 
signals to trigger rapid electrical stimulation in 
the nervous system, allowing Hebb’ theory to 
be tested directly in living organisms. A pivotal 
2006 experiment’ in monkeys demonstrated 
that artificial coupling of areas in the brain’s 
cerebral cortex can alter the function of the 
connected neurons. More-recent studies*” 
have begun to explore whether timed stimu- 
lation can strengthen synaptic connections in 
the brain and spinal cord. 

McPherson and colleagues experimented 
with a variation on this technique in injured 
rats. They recorded an electrical signal from 
muscle that indicated that the rat was in the 
process of contracting that muscle, and used 
the signal to trigger an electrical-stimulation 
pulse in the spinal cord (Fig. 1). They hypoth- 
esized that, in this way, voluntary commands 
to induce muscle contraction (which are 
transmitted from the cerebral cortex and per- 
haps from other brain centres) would arrive 
at motor neurons in the spinal cord shortly 
before the neurons were activated 
by the stimulation pulse. This would 
strengthen the synaptic connections 
in the spinal cord. Once the connec- 
tions had been strengthened, the 
voluntary commands from the brain 
might be sufficient to drive muscle 
contraction on their own. 

First, the authors injured one 
side of the spinal cord of rats. This 
resulted in paralysis of the forelimb 
on the injured side, followed by lim- 
ited natural recovery. The research- 
ers then inserted microwires into 
the spinal cord, below the level of 
the injury. Such microwires can 
stimulate neurons, and even move- 
ment, with extremely low electrical 
currents (approximately ten times 
lower than that needed for neuronal 
stimulation at the surface of the spi- 
nal cord), allowing specific activa- 
tion of motor neurons”’. Wires were 
also implanted in forelimb muscles 
to allow recording of electrical sig- 
nals and were used to trigger spinal- 
cord stimulation. 

Six weeks after spinal-cord injury, 
the rats were unable to reach and 
grasp a small food pellet with the 
affected forelimb. All the animals 
were severely impaired, achieving 
only about 13% of their pre-injury 
scores in this reaching task. Over 


the following 13 weeks, they underwent 
rehabilitation training, practising reaching 
for 30 minutes each day, 5 days per week. One 
group of rats received only rehabilitative train- 
ing; a second group received rehabilitation and 
electrical stimulation in the spinal cord that 
was triggered by muscle activity; a third group 
received rehabilitation and electrical stimula- 
tion in the spinal cord that was not linked to 
the timing of muscle activity. 

Rats in the second group demonstrated 
substantial recovery. At the end of the regime, 
they had regained 63% of their pre-injury 
motor ability. By contrast, rats in the first 
and third control groups made much smaller 
improvements. Perhaps most importantly, 
the second group maintained motor function 
after a three-week follow-up period. By tim- 
ing spinal-cord stimulation to coincide with 
the ‘sweet spot’ for motor-neuron activation, 
synaptic connections seemed to have been 
strengthened over time, allowing enhanced 
motor recovery. 

Although spinal-cord circuitry is complex, 
and the exact mechanisms that underlie this 
dramatic effect remain to be fleshed out, 
McPherson and colleagues’ results have sub- 
stantial clinical relevance. This study is an 
example of therapeutic strategies for neu- 
ral repair that go beyond simply exciting the 
injured nervous system with a blast of electrical 
stimulation. By mirroring the timing rules that 
the nervous system normally uses to enhance 
the strength of synaptic connections, it might 
be possible to guide neuronal plasticity in a 


more functional and adaptive way after injury. 

Nevertheless, many theoretical and practical 
issues need to be addressed before such 
approaches are attempted in humans. For 
example, the safety of chronic electrical stimu- 
lation using electrodes in the spinal cord must 
be thoroughly tested. The timing of the stimu- 
lation may also need to be adjusted on the basis 
of differences in nerve-fibre size and speed of 
conduction in the human nervous system 
compared with that of rats. Nonetheless, the 
translation of techniques that exploit synap- 
tic plasticity from bench to bedside is now 
a little closer. m 
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Ecosystem responses 
to climate extremes 


Extreme drought or wet conditions have now been found to strongly influence the 
vegetative development of ecosystems. Semi-arid regions are most affected — 
raising concerns about their vulnerability to long-term drought in the future. 


ANJA RAMMIG & MIGUEL D. MAHECHA 


xtreme climatic conditions such as 
Hee or heatwaves are likely to inten- 

sify in the next few decades’. Long-term 
observations” of past decades suggest that 
characteristic recurrence frequencies, inten- 
sities and durations of certain extreme events 
have already increased noticeably. One press- 
ing question is whether key ecosystem services, 
such as the capacity of ecosystems to accumu- 
late carbon, are affected by extreme events’. 
The potential of ecosystems to accumulate 
carbon is intimately related to the phenology of 
vegetation* — essentially, all the characteristic 


periodicities in an organism’s life, such as 
the annual cycles of plant leaf development. 
Writing in the Journal of Geophysical Research, 
Ma et al.° describe how climate extremes 
modify the seasonal vegetation development 
of different ecosystems. 

Maand colleagues’ study relies mainly on a 
measure called the enhanced vegetation index. 
This is used to determine whether a region of 
Earth contains live green vegetation, and can 
be derived from spectral data that have been 
collected by satellites for more than a decade. 
Vegetation indices of this kind are often 
used as indicators of vegetation productivity, 
although they cannot really be directly 
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50 Years Ago 


In the last of nine papers presented 
to ... the British Association for the 
Advancement of Science ... Dr. K. 
Adam discussed the responsibility 
of television: this responsibility 

has since been sharpened by 

the continuing discussions on 

the feasibility and the scope for 

a ‘university of the air’ ... itis 
proposed that only the major 
conurbations, containing 60-70 
per cent of the population, should 
be covered by broadcast television 
... coverage in this way does not 
mean that the public beyond the 
range of television and sound 
transmission would be unable to 
enrol at the University of the Air ... 
all the programmes provided for 
television by the University should 
be recorded and prints of the 
original transmission can readily be 
circulated anywhere for viewing ... 
All the courses would be supported 
by carefully prepared lecture notes 
and reading lists, and each week all 
enrolled students would be required 
to submit exercises for marking and 
comment by tutors. 

From Nature 20 November 1965 


100 Years Ago 


The little attention given to science 
in education and in the public mind 
has been the theme of many essays 
and addresses ... science is usually 
regarded as suitable for study by 
aselect few only, and not as an 
essential part of all modern life and 
thought ... We do not fora moment 
suggest that the end ofall education 
should be preparation for scientific 
careers; neither do we ask that men of 
letters, statesmen, and administrative 
officers of departments of State 
should all be scientific experts... 
Our claim is that everyone — from 
elementary-school pupil to college 
don — should be made acquainted 
with appropriate outlines of scientific 
work and thought. 

From Nature 18 November 1915 
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Figure 1 | Hummock grassland during dry and wet growth seasons. Some ecosystems, such as hummock grasslands in southeastern Australia, compensate 
for poor growth during dry periods by increasing growth during wet periods. Ma et al.” report that such ecosystems have until now been less vulnerable to 


drought than are croplands or pasture. 


interpreted as such. Nevertheless, such indices 
reveal the annual cycle of vegetation-canopy 
development within and across ecosystems. 

Drought stress is also commonly described 
by an index that allows an intuitive interpre- 
tation of an ecosystem’s water balance. In 
combination with temperature data, indices 
for vegetation and drought allow researchers 
to describe the complex interplay between 
vegetation and climate, as Ma and colleagues 
do in their work. 

In contrast to previous studies, which 
mainly examined the phenology of individual 
ecosystems on subcontinental scales, Ma et al. 
revealed the impacts of climatic extremes on 
the phenology of different ecosystem types 
on a continental scale. They show that two 
extreme years, 2002 (extremely dry) and 2010 
(extremely wet), had significant effects on 
vegetation phenology and productivity across 
southeastern Australia — a region character- 
ized by large rainfall and temperature gradi- 
ents, and which has vegetation types ranging 
from arid grasslands to forests. The authors 
find that 70% of the area had a prolonged 
growing season during 2010, whereas there 
was essentially no observable growing sea- 
son in 2002. These observations indicate the 
fundamental relevance of fluctuations in water 
availability for phenology. 

A crucial question emerges from this work: 
which vegetation types and ecosystems are 
most vulnerable to extreme conditions? By 
analysing the change in the vegetation index 
relative to changes in drought conditions for 
each location, Ma and co-workers suggest 
that cropland and pastures are most sensitive 
to fluctuations in climatic conditions. Natu- 
ral systems, such as the hummock grasslands 
investigated in the study, adapt to high year- 
to-year variations in rainfall by increasing 
growth during wet periods to compensate for 
poor growth during dry periods (Fig. 1). But 
agricultural systems cannot do this — they 


respond only to dry periods by reducing 
growth, which supports the view that agri- 
cultural systems might be strongly affected by 
extreme climatic events. 

The authors show that drought sensitivity 
peaks in semi-arid regions across different 
climate regimes. This is of interest, because 
semi-arid ecosystems cover approximately 
40% of the global land surface®. Ecosystems 
of this kind are often dominated by grasses or 
shrubs that have developed mechanisms to sur- 
vive long periods of dryness, and which grow 
rapidly when rainfall arrives. Ma et al. show 
that growth was almost completely dormant 
during 2002, and increased swiftly when the 
rains arrived during 2010. A remaining ques- 
tion is how long this behaviour can be main- 
tained during severe droughts that last for more 
than one season. In other words, how resilient 
are ecosystems to increasing fluctuations in 
climate? With additional pressure from human 
land-use activities, the resilience of many of 
these ecosystems may be ona knife-edge’. 

Ma and co-workers note that their findings 
have serious implications for estimates of the 
effects of extreme weather events on the global 
carbon cycle. But whether and how these 
results can be extrapolated to the global scale 
is unclear. Semi-arid regions have recently 
gained attention*” for their ability to take up 
large amounts of carbon dioxide from the 
atmosphere during favourable conditions, 
thereby driving the variability in the rate 
of increase in CO, levels in the atmosphere. 
But it is not known whether, or for how long, 
semi-arid regions and other terrestrial carbon 
sinks will continue to absorb so much CO,, nor 
what the magnitude of this carbon sink is — 
particularly given the uncertainty in predict- 
ing changes in the frequency and intensity of 
extreme events’. 

Overall, the authors’ results demonstrate 
the need for a better understanding of plant 
responses to increased climate variability if 
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regional and global models are to be improved. 
Obtaining such an understanding will require 
a multifaceted approach. For instance, differ- 
ences in the size of responses might be partly 
attributable to varying rates of biodiversity”®, 
a factor that cannot be considered on the 
basis of the relatively coarse spectral infor- 
mation contained in conventional vegetation 
indices. And although Ma and colleagues’ 
study provides much-needed insight into 
ecosystem behaviour in response to extreme 
events, we can still only speculate about the 
exact nature of the underlying ecophysiologi- 
cal mechanisms and the potential legacy of 
such events. m 
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Dpp spreading is required for medial but 
not for lateral wing disc growth 


Stefan Harmansa!, Fisun Hamaratoglu’, Markus Affolter!* & Emmanuel Caussinus! 


3% 


Drosophila Decapentaplegic (Dpp) has served as a paradigm to study morphogen-dependent growth control. However, 
the role of a Dpp gradient in tissue growth remains highly controversial. Two fundamentally different models have been 
proposed: the ‘temporal rule’ model suggests that all cells of the wing imaginal disc divide upon a 50% increase in Dpp 
signalling, whereas the ‘growth equalization model’ suggests that Dpp is only essential for proliferation control of the 
central cells. Here, to discriminate between these two models, we generated and used morphotrap, a membrane- tethered 
anti-green fluorescent protein (GFP) nanobody, which enables immobilization of enhanced (e)GFP::Dpp on the cell 
surface, thereby abolishing Dpp gradient formation. We find that in the absence of Dpp spreading, wing disc patterning 
is lost; however, lateral cells still divide at normal rates. These data are consistent with the growth equalization model, 
but do not fit a global temporal rule model in the wing imaginal disc. 


Morphogens regulate patterning and growth of tissues and organs by 
forming long-range gradients from regions of high concentration (the 
source) to regions of low concentration (the adjacent target field)'~. 
In Drosophila, the vertebrate bone morphogenetic protein (BMP)2/4 
homologue Dpp is studied extensively in the wing imaginal disc. This 
larval precursor of the fly wing and the dorsal thorax is subdivided 
into an anterior and a posterior compartment®’. Dpp is expressed 
in a stripe of anterior cells adjacent to the compartment boundary*, 
forming long-range anterior and posterior extracellular gradients in 
the target field?"'!. The Dpp gradient is transduced by its receptors 
Thickveins (Tkv)'” and Punt” and translated into an intracellular gra- 
dient of phosphorylated Mothers against dpp (p-Mad)!*. Dpp signal- 
ling suppresses transcription of brinker (brk)!5-!’, a repressor of Dpp 
target gene transcription’! and a repressor of growth’®. This results in 
high p-Mad levels (high Dpp signalling) in the medial region of the 
wing disc and high Brk levels (low Dpp signalling) in the lateral region 
of the wing disc. The interplay of p-Mad and Brk coordinates the 
expression profiles of other Dpp targets, such as spalt (sal), optomoter 
blind (omb; also known as bifid) and daughters against dpp (dad)'°~*". 
In addition to its role in patterning, Dpp is a key regulator of growth; 
overexpression of Dpp promotes wing disc overgrowth”, while dpp 
mutant wing discs remain very small”. 

To our knowledge, the requirement for Dpp spreading has never 
been explicitly tested by, for example, blocking Dpp dispersal by 
tethering it to the cell membrane, as has been done for the Wingless 
(Wg) morphogen**”’. The available experimental evidence strongly 
supports an instructive and essential role for Dpp spreading in the 
control of patterning (reviewed in refs 14, 28, 29). However, the role of 
Dpp spreading in growth control is highly controversial**3?!. Two 
major models have been suggested to explain how the Dpp gradi- 
ent controls uniform proliferation and growth of the wing disc. One 
model, the temporal rule, suggests that all cells of the wing imaginal 
disc compute the level of Dpp and divide upon a 50% increase in Dpp 
signalling. In contrast, the growth equalization model proposes that 
Dpp sustains the proliferation of medial cells by the removal of the 
growth repressor Brk, while the proliferation rate of lateral cells is 
limited by Brk to rates that can be sustained by medial cells, resulting 


in a uniform proliferation profile along the wing disc tissue***?"*4. In 


the growth equalization model, the Dpp/Brk system is not a growth 
promoter but is rather a growth-modulatory system, ironing out 
inherent regional differences in proliferation rates**. To study the 
role of Dpp spreading in wing disc patterning and growth better, 
we designed and experimentally established a novel approach to 
manipulate morphogen spreading in vivo. 


Nanobody-mediated morphogen trapping 

To manipulate the Dpp gradient in vivo, we designed and implemented 
a synthetic morphogen trapping system consisting of a GFP-tagged 
morphogen (in our case, eGFP::Dpp) and a generic extracellular 
GFP trap (VHH-GFP4::CD8::mCherry; referred to as morphotrap) 
(Fig. 1a). Our eGFP::Dpp construct is based on a previously pub- 
lished fusion protein? and was implemented as a LexA inducible 
transgene (see Methods and Extended Data Fig. 1a). Morphotrap 
(VHH-GFP4::CD8::mCherry) represents a fusion protein consisting 
of an extracellular, single-domain nanobody against GFP*° (and cog- 
nate fluorescent tags, including eGFP), followed by the mouse CD8 
transmembrane domain and a cytoplasmic mCherry fluorescent tag. 
Morphotrap was implemented as a Gal4-inducible transgene as well as 
a LexA-inducible transgene (see Methods). The principle idea behind 
morphotrap is to immobilize the extracellular fraction of eGFP::Dpp in 
Drosophila tissues in a controlled spatial manner, either in the presence 
or the absence of wild-type Dpp (Fig. 1a). 

Expression of eGFP::Dpp by the dpp-LG LexA driver line in a 
dpp**"”? mutant background restored proper Dpp signalling in the 
wing tissue such that the size and pattern was rescued to a large extent 
and adult flies developed (Extended Data Fig. 1). These results show 
that our eGFP::Dpp fusion protein acts as a good surrogate for Dpp in 
the wing disc. 

Morphotrap localizes along the basolateral and apical surface of 
wing disc cells (Fig. 1b), and does not interfere with Dpp signalling or 
cell survival when expressed at high levels (Extended Data Fig. 2a-e). 
Therefore, morphotrap can be expressed at high levels and accumu- 
lates around the expressing wing disc cells without interfering with cell 
division and patterning. 


1Growth & Development, Biozentrum, Klingelbergstrasse 50/70, University of Basel, 4056 Basel, Switzerland. @Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, 
Switzerland. *Institute of Molecular Life Sciences (IMLS), University of Zurich, 8057 Zurich, Switzerland. 
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Figure 1 | Morphotrap can block eGFP::Dpp spreading. a, The 
morphotrap system. b, Optical cross-section (left, middle) and schematics 
(right) along the apical—basal (A-B) axis of a wing disc expressing 
morphotrap in disc proper (DP) cells only (nubbin::Gal4). Morphotrap 

is localized all along the cell membrane (in red), both apical (arrow) and 
basal to the junctional marker Discs-large (Dlg; in green). peripodial 
membrane (PPM). c, Schematic representation of eGFP::Dpp (LexA/ 
LOP) and morphotrap clones (Gal4/UAS). For all discs anterior (A) is 
oriented to the left and dorsal (D) is oriented to the top. d, A wild-type 


Morphotrap can modify the Dpp gradient 

We then tested whether exposing morphotrap on the cell surface locally 
modified the extracellular concentration of eGFP::Dpp. We generated 
random small clones of morphotrap in wild-type wing discs expressing 
eGFP::Dpp in the domain of dpp transcription. To set apart the induced 
morphotrap clones from the cells expressing eGFP::Dpp, we used Gal4 
and LexA drivers to induce morphotrap and eGFP::Dpp, respectively 
(Fig. 1c; see Methods). In control discs, in which no clones were gener- 
ated, eGFP::Dpp formed a bilateral extracellular concentration gradient 
visualized by sensitive extracellular immunostainings against eGFP 
(Fig. 1d). The eGFP signal dropped below detection levels at a distance 
of approximately 60 um from the medial eGFP::Dpp source. In discs 
in which small clones expressing morphotrap had been generated, we 
detected high levels of extracellular eGFP::Dpp coating the surface of 
the clone cells, even when the clones were located in regions in which 
eGFP::Dpp was not detected otherwise (Fig. le). These results show 
that morphotrap is able to sequester extracellular eGFP::Dpp, even in 
areas of low or non-detectable eGFP::Dpp. 

Trapped eGFP::Dpp was active in signalling, since morphotrap 
clones located in the lateral region of the disc showed increased p-Mad 
levels, mainly along the edge facing the eGFP::Dpp source (Extended 
Data Fig. 2f, g). The results show that eGFP::Dpp disperses over 
the entire width of the disc, although its levels cannot normally be 
detected above background levels in the lateral regions using fluores- 
cent microscopy (see also ref. 30). We conclude that eGFP::Dpp can 
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wing disc expressing eGFP::Dpp in the Dpp stripe (dpp::LG), visualized 

by eGFP fluorescence (left) or by extracellular GFP staining (exGFP) 
(middle). Fluorescence intensity profile of the region marked by a red 
rectangle (right). a.u., arbitrary units. e, Lateral morphotrap clones trap 
extracellular eGFP::Dpp. f, Gradient formation is blocked by co-expression 
of eGFP::Dpp and morphotrap in the Dpp stripe (both expressed by 
dpp::LG). g, Co-expression of eGFP::Dpp and morphotrap in the stripe 
fully blocks Dpp spreading since additional morphotrap clones do not 
show eGFP signal (see insets in second panel from the left). 


interact with its receptors when bound to the cell surface by morpho- 
trap and that lateral cells can respond to Dpp. 

To investigate whether morphotrap was able to interfere with the 
formation of the extracellular concentration gradient of eGFP::Dpp 
when expressed in the source cells, we expressed both eGFP::Dpp 
and morphotrap in wild-type wing discs in the domain of dpp tran- 
scription. Under these conditions, we did not detect any dispersal 
of eGFP::Dpp using antibody staining (Fig. 1f), suggesting that 
eGFP::Dpp cannot leave the source region owing to tethering to 
secreting cells. Furthermore, clones expressing morphotrap in lateral 
cells did not accumulate any eGFP::Dpp on the cell surface in these 
conditions, neither did clones in the vicinity of the eGFP::Dpp source 
(Fig. 1g, middle; see insets). These results demonstrate that morpho- 
trap fully retains eGFP::Dpp on source cells and completely abol- 
ishes the formation of the extracellular concentration gradient of 
eGFP::Dpp. 


Dpp spreading is required for patterning 

The function of dpp for patterning the wing disc has been studied 
extensively''%°°”, However, how a loss of Dpp spreading would 
affect target gene expression has not been tested directly. To compare 
Dpp signalling responses in control dpp“*"? wing discs rescued by 
eGFP::Dpp (eGFP::Dpp gradient is present) to Dpp signalling 
responses in dpp**“!? wing discs expressing both eGFP::Dpp and 
morphotrap in the expression domain of dpp (eGFP::Dpp gradient 
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Figure 2 | Blocking Dpp spreading results in a sharp p-Mad/Brk 
transition. a, Representative dpp**"”? mutant wing disc rescued with 
eGFP::Dpp (rescue) and stained for exGFP (grey). b, Left, exGFP signal in 
a disc co-expressing eGFP::Dpp and morphotrap (dpp-LG, co-expression) 
in a dpp®4!? mutant. Right, magnification of the region marked by the 
rectangle on the left, showing that all signal is from anterior cells where 
Dpp is expressed. A/P boundary is determined by Ptc staining (green) 
and marked by a dotted line. Approximate domain size is marked by 
arrowheads. c, d, p-Mad staining in rescue (c) and co-expression (d) wing 
discs. e, f, Brk staining in rescue (e) and co-expression (f) wing discs. 

g-f, Average fluorescence intensity profiles of 98-100 h after egg laying 
(AEL) old larvae measured to the edge of the wing disc of rescued (g) and 
co-expression (h) wing discs. Profiles were measured with 30% dorsal 
(D) offset parallel to the dorso/ventral (D/V) boundary (see Methods for 
details). Error bars show standard deviation (s.d.). 


is absent; Methods and Fig. 2), we performed immunostainings 
against p-Mad, Brk, Sal and Omb. In control discs, p-Mad, Sal and 
Omb formed three bilateral gradients of different widths, Sal being 
the narrowest and Omb being the widest (Fig. 2c, g and Extended 
Data Fig. 3a); Brk was only detected in the most lateral regions of 
the discs (Fig. 2e, g). In contrast, when eGFP::Dpp and morpho- 
trap were co-expressed, Dpp spreading and hence gradient forma- 
tion was fully blocked throughout development (Extended Data 
Fig. 4a—c). In these discs, the p-Mad, Sal and Omb gradients col- 
lapsed in the posterior compartment to a single row of cells abut- 
ting the anterior source of eGFP::Dpp (Fig. 2d, h and Extended Data 
Fig. 3b); high levels of Brk were detected in the posterior compart- 
ment up to the source of Dpp, except for a single row of cells abutting 
the compartment boundary (Fig. 2f, h). Similar results were obtained 
regarding target gene expression in the anterior compartment upon 
trapping eGFP::Dpp in source cells (Fig. 2h and Extended Data Fig. 3). 
In addition, we inhibited eGFP::Dpp dispersal in posterior cells only 
(Extended Data Fig. 5); under these conditions p-Mad failed to form 
a posterior long-range gradient and both Sal and Omb expression col- 
lapsed onto the narrow p-Mad domain. Hence, wing disc patterning in 
the posterior compartment was abolished (Extended Data Fig. 5a-f). 
Wings of flies with blocked or reduced Dpp spreading lacked proper 
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Figure 3 | The uniform proliferation pattern is independent of Dpp 
spreading. a, Top, the temporal rule model of growth control. Bottom, 

the growth equalization model. b, p-H3 staining in a representative 
dpp48/412 mutant wing disc rescued with eGFP::Dpp. The A/P boundary 
and the pouch outline are marked by dotted lines (right). c, e, Computed 
p-H3 spots density (of n = 16 discs) in rescue (c) and co-expression (e) 
wing discs (see Methods). d, p-H3 signal in a dpp**“? mutant wing 

disc co-expressing eGFP::Dpp and morphotrap. f, Mitotic density in the 
anterior (P > 0.05) and posterior pouch (P > 0.05); whiskers correspond to 
minimum and maximum data points. 


wing vein patterning (Extended Data Figs 3f and 9d, f). Altogether, 
these results show that dispersal of Dpp is strictly required for the 
patterning function of Dpp. 


Dpp spreading and growth control 

Despite numerous studies addressing the role of Dpp in the control of 
growth of the wing imaginal disc, the conclusions drawn from different 
sets of experiments have resulted in conflicting interpretations. In the 
temporal rule model, all disc cells compute the increase in Dpp 
levels and divide upon a gain of 50%. In sharp contrast, the 
growth equalization model”* proposes that lateral cells proliferate 
independently of Dpp (Fig. 3a). In line with this later model, Dpp 
signalling has been blocked in regions outside of the wing pouch in 
several studies, without much effect on cell proliferation*?#°. However, 
it has not been possible to directly modulate the Dpp gradient at the 
protein level until now, making it difficult to interpret the requirement 
of Dpp long-range function in growth control. 

To discriminate between these two growth control models, we 
aimed at using a different experimental approach, directly eliminating 
the Dpp gradient at the protein level using morphotrap. As described 
earlier, the elimination of the gradient leads to the absence of Dpp sig- 
nalling, that is, the target genes sal and omb are not expressed in the 
wing epithelium beyond the source cells and the immediate neighbours, 
and the Brk repressor is present at high levels in all cells beyond the 
Dpp source. We thus compared the proliferation pattern of control 
dpp**"!? wing discs rescued by eGFP::Dpp to the proliferation pattern of 
dpp**’“!? wing discs expressing both eGFP::Dpp and morphotrap in the 
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Figure 4 | Block of Dpp spreading does not affect clonal proliferation 
rates. a—d, To estimate clonal proliferation rates in the posterior 
compartment, Raeppli was induced 30h before dissection (66-70 h AEL) 
and larvae were dissected at 96-100h AEL. a, Control disc. b, Disc with 
blocked Dpp spreading. c, Cell numbers per clone were counted and 
plotted against the relative position in the posterior compartment 

(0 corresponding to the A/P boundary and | to the posterior edge of the 
disc). d, Boxplots showing the number of cells per clone for all data points 
(left plot, P > 0.05) or when the 1-3 cell clones are excluded (right plot, 
P>0.05). 


expression domain of dpp, that is, we compared the growth rates of wing 
disc cells in the presence and in the absence of eGFP::Dpp spreading. 
We visualized the proliferation pattern of such wing discs by staining 
for phospho-histone H3 (p-H3), a marker for mitotic cells. In wild- 
type wing discs, cell proliferation was shown to be rather homogene- 
ous in third instar wing discs”*!*. Our quantitative analyses showed 
that in discs rescued with eGFP::Dpp, the proliferation profile was 
also uniform (Fig. 3b, c). Interestingly, blocking Dpp spreading nei- 
ther affected the uniform proliferation pattern (Fig. 3d, e) nor did we 
detect significant changes in the mitotic density in wing imaginal discs 
during the observed developmental stages (Fig. 3f and Extended Data 
Fig. 4g-i). 

To obtain a more global and more quantitative view of the cell divi- 
sion patterns, we used the whole-tissue labelling tool Raeppli* to 
induce differently marked clones in control wing discs and in wing 
discs in which Dpp spreading was blocked by morphotrap (Fig. 4 and 
Extended Data Fig. 6). To compare the proliferation rates in the pres- 
ence or absence of Dpp spreading, we induced colour selection in clones 
at different time points of development and quantitatively evaluated 
the resulting clone size after defined time points (number of cells per 
clone). In control wing discs, clonal growth rates were homogeneous 
along the anterior—posterior (A/P) axis (Fig. 4c, black dots). When 
Dpp spreading was blocked, we observed that the majority of clones 
showed similar growth rates to control clones, and we did not find 
a significant difference in clonal proliferation between controls and 
discs with blocked Dpp spreading (Fig. 4d). However, with blocked 
Dpp spreading, we also found low numbers of small clones (1-3 cells) 
next to the A/P boundary (Extended Data Fig. 7). These small clones 
were not found in control discs, in which Dpp spreading was normal. 
The presence of such small clones might hint towards the fact that a 
subpopulation of wing disc cells depend on Dpp signalling to divide 
and/or survive. 

Both the p-H3 data and the Raeppli results demonstrate that the cells 
in the lateral Brk domain do not depend on Dpp spreading to prolif- 
erate (in contradiction with the temporal rule model), but rather that 
the proliferation rate is set by a Dpp-independent system (consistent 
with the growth equalization model). 
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Figure 5 | The development of the medial but not the lateral wing 

disc requires Dpp spreading . a-k, Data set from 79-112h AEL stained 
for Brk. a-d, Representative rescued wing discs of four time points 
investigated. e, Magnification of area marked in d, visualizing the location 
of medial (high Dpp signalling) and lateral (low Dpp signalling) domain 
(see Methods). f, Temporal development of domain width in the posterior 
compartment in rescued discs. g-j, Representative co-expression wing 
discs. k, Temporal development of domain width in co-expression wing 
discs, and size change of the lateral domain relative to control discs (rescue 
n= 34, co-expression n = 37). f, k, Error bars show s.d. 


Dpp spreading and size control 

Using morphotrap in dpp mutant flies also allowed us to address how 
long-range spreading of Dpp affects wing disc size control. We quan- 
tified and compared the temporal growth profile of the posterior com- 
partment of control dpp4*/4!2 wing discs rescued by eGFP::Dpp to the 
growth profile of dpp48/4!2 wing discs co-expressing both eGFP::Dpp 
and morphotrap in the expression domain of dpp. We performed 
immunostainings against Brk at different time points between 80 and 
112h after egg laying (AEL). In control discs, the posterior compart- 
ment doubled in width during the observed time window (Fig. 5a—-d 
and Extended Data Fig. 8d). We delimited a medial low Brk (indicat- 
ing high Dpp signalling) zone and a lateral high Brk (indicating low 
Dpp signalling) zone (Fig. 5e; see Methods); both zones increased in 
width at the same speed, keeping a constant relative proportion of 1:1 
(Fig. 5f), consistent with published data**. In discs in which spread- 
ing of Dpp was abolished, the low Brk zone in the centre of the disc 
was reduced to a single medial row of cells in the posterior compart- 
ment (see earlier and Fig. 5g-j). During the observed time window, 
the lateral part of the posterior compartment showed similar widths 
and width increases as the lateral high Brk zone of the posterior com- 
partment of control discs (Fig. 5k). Similar growth profiles were seen 
in discs expressing eGFP::Dpp in the source stripe and morphotrap 
in the posterior compartment (Extended Data Fig. 5g). These results 
demonstrate that growth in the lateral region of the wing disc is inde- 
pendent of the extracellular Dpp gradient and does not depend on the 
dynamics of Dpp signalling. In support of this finding, similar growth 
dynamics were observed for the anterior compartment (Extended Data 
Fig. 8a, b). 
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In contrast, the medial, Brk-negative region is lost when Dpp 
spreading is blocked, suggesting that Dpp dispersal is important for 
growth control of the medial region, in particular in the central wing 
pouch area. We therefore quantified wing pouch size using the inner 
W¢-expression ring as a pouch marker. We measured the size of the 
pouch in dpp mutant discs rescued with eGFP::Dpp, and compared it to 
the pouch of discs in which either eGFP::Dpp dispersal was hindered in 
the posterior compartment only, or in which the release of eGFP::Dpp 
from the anterior source was completely blocked (Extended Data 
Fig. 9). Upon hindering Dpp spreading in the posterior compartment, 
the size of the posterior pouch was reduced by approximately 40%. 
Strikingly, when we trapped eGFP::Dpp in the source, the size of the 
posterior wing pouch was even further reduced (by more than 60%). 
These results indicate that eGFP::Dpp spreading is essential for wing 
pouch growth. The analyses using the whole-tissue labelling technique 
Raeppli (Extended Data Fig. 7) further showed that small clones were 
found in the posterior compartment close to the compartment bound- 
ary when morphotrap is expressed in source cells. Such clones were not 
found in control discs. Together, these data show that Dpp signalling 
has an important role in proliferation control of medial wing pouch 
cells, as indicated by earlier studies*?*”°, and further suggest that the 
range of Dpp spreading might be crucially linked to the size of the wing 
pouch region along the A/P axis. 


Discussion 

We used morphotrap, a novel approach to manipulate the extracellular 
Dpp gradient in the wing imaginal disc. Expressing morphotrap in 
small clones of lateral wing disc cells captures eGFP::Dpp in regions 
of the disc in which eGFP::Dpp cannot be detected above background 
levels. This finding demonstrates that Dpp does disperse over the entire 
wing imaginal disc, and that low Dpp levels could control cell behaviour 
even in lateral regions. However, we find that while Dpp spreading 
is strictly required for wing disc patterning, it is not essential for cell 
proliferation in the lateral region of the wing disc. These results are 
consistent with the growth equalization model but are in disagreement 
with a disc-wide temporal rule model, and suggest that lateral cells do 
not compute Dpp signalling levels to trigger cell division. It has been 
argued that Dpp-independent Dpp signalling (in addition to Dpp- 
dependent Dpp signalling) might control cell proliferation according 
to the temporal rule model". This interpretation was based on the 
observation that in genetic experiments in which Dpp signalling was 
eliminated by the concomitant genetic removal of brk and tkv (or brk 
and dpp), certain Dpp targets were active owing to the absence of the 
potent Brk repressor*!3°. However, in our experiments using morpho- 
trap, Dpp signalling was eliminated via the removal of the Dpp gradient 
and led to the absence of Dpp target gene expression and to the pres- 
ence of high levels of Brk in the entire lateral wing disc. Therefore, in 
our experimental setting, Dpp signalling was turned off in the lateral 
cells, yet these cells divided at a normal rate, as quantitatively shown 
by our experiments using Raeppli. As cell division should be abolished 
(or altered) in the absence of Dpp signalling, according to the temporal 
rule, our experiments reject a general, disc-wide temporal rule model 
for wing disc growth control. 

However, our data are entirely consistent with the proposal of 
the growth equalization model, suggesting that Dpp spreading 
results in medial removal of Brk and that this repression of brk 
represents an essential step in the formation of the wing pouch 
tissue**. Our results support the growth equalization model that the 
wing disc tissue consists of two regions with different requirements 
for Dpp signalling, namely a medial region that depends on Dpp 
signalling to grow and a lateral region that grows independent of 
Dpp. 

While the growth equalization model does not explain final organ 
size, our results suggest that the range of Dpp spreading is linked to 
the size of the wing pouch (albeit not to the entire disc). In a number 
of elegant studies, the range of Wg signalling was suggested to control 
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pouch growth via a feed-forward recruitment mechanism?”*, presum- 


ably together with Dpp. Interestingly, the replacement of the major 
endogenous Drosophila Wnt, Wg, with one that expresses a membrane- 
tethered form of the protein, showed that Wg spreading and gradient 
formation is dispensable for patterning and to some extent for growth 
of the pouch”®. In contrast, our results on Dpp strongly support the 
notion that Dpp spreading is essential for its role in pouch patterning 
and size control. Getting a better understanding of the control of wing 
pouch growth will require the combinatorial manipulation of the Dpp 
and the W¢ signalling pathways to study individual pathway outputs 
as well as their mutual interactions at different time points throughout 
larval development. Furthermore, it will be of major importance to 
study the interactions of the morphogen systems with other growth 
control systems (for example, the insulin-phosphatidylinositol-3-OH 
kinase and the Hippo pathways) to better understand the control of 
final organ size*”. The addition of the morphotrap and the Raeppli 
techniques to such analyses will help gain better insight into how mor- 
phogens control organ growth. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Fly strains. The following fly lines were used: yw!" (wild type), dpp-LG86Fb and 
LOP::mCherry-CAAX (K. Basler“’), tub > CD2,Stop > Gal4 (F. Pignioni). P{Cre}1b 
was obtained from Bloomington. hh-Gal4, dpp-Gal4, nub-Gal4, dpp® and dpp*? 
are described in FlyBase (http://www.flybase.org). 

Genotypes by figure. Figure 1b: w; nub-Gal4/UAS-morphotrap; Fig. 1d: w; 
LOP-eGEP::Dpp/+; dpp-LG/+; Fig. le: yw, hsFlp; tub >CD2,Stop > Gal4, LOP- 
eGFP::Dpp/UAS-morphotrap; dpp-LG/+; Fig. 1f: w; LOP-eGFP::Dpp/LOP- 
morphotrap; dpp-LG/+; Fig. 1g: w hsFlp; LOP-eGFP::Dpp, tub >CD2,Stop > Gal4/ 
LOP/UAS-morphotrap; dpp-LG/+. Fig. 2a, c, e: w; LOP-eGFP::Dpp, dpp"'?/dpp®; 
dpp-LG/+; Fig. 2b, d, f: w; LOP-eGFP::Dpp, dpp“!?/LOP-morphotrap, dpp®; 
dpp-LG/+; Fig. 3b, c: w; LOP-eGEP::Dpp, dpp*'?/dpp“’; dpp-LG/ +: Fig. 3d, e: w; 
LOP-eGEP::Dpp, dpp"'?/LOP-morphotrap, dpp“; dpp-LG/+; Fig. 4a: yw, hsFlp; 
LOP-eGEP::Dpp, dpp"'?/dpp"; dpp-LG, hh-Gal4/2 x LOP/UAS::Raeppli; Fig. 4b: yw, 
hsFlp; LOP-eGFP::Dpp, dpp"'?/LOP-morphotrap, dpp“*; dpp-LG, hh-Gal4/2 x LOP/ 
UAS::Raeppli; Fig. 5a—d: w; LOP-eGFP::Dpp, dpp“'?/dpp“*; dpp-LG/+; Fig. 5g-j: w; 
LOP-eGFP::Dpp, dpp'?/LOP-morphotrap, dpp“; dpp-LG/+. 

Molecular cloning. For pUASTLOTattB_eGFP::Dpp, GFP was replaced by 
eGFP in the Dpp-GFP plasmid? (obtained from S. Cohen). Then, eGFP::Dpp 
was inserted in the multiple cloning site of pPUASTLOTattB vector** by standard 
cloning procedures. 

For pUASTLOTattB_VHH-GFP4::CD8::mCherry, we inserted the 
VHH-GFP4 fragment after the signal peptide sequence of the mouse CD8 domain 
in the pUAS::CD8::GFP plasmid“’. We replaced the GFP by a mCherry (Clonetech) 
and finally cloned the VHH-GFP4::CD8::mCherry fragment into the pUASTLO- 
TattB vector®. 

Transgenes were inserted by phiC31-integrase-mediated recombination into 
the 35B region on the 2nd chromosome. Resulting fly lines are responsive to LexA 
(ref. 48) and Gal4 (ref. 50) transcriptional activators. By crossing these flies to 
Cre’-expressing flies, either the UAS or the LOP site is being excised in a mutually 
exclusive manner. Excision was screened for by PCR as described previously’. 
Creation of wing disc data sets. Flies were kept in standard fly vials (contain- 
ing polenta and yeast) in a 26°C incubator. Larvae were staged as described 
previously“*. In our data sets, we only included male larvae, which were positively 
selected for the presence of the genital disc. All male larvae of a collection were 
dissected and further processed to obtain maximum sample numbers. 

Statistics and data representation. The phenotypes observed and quantified 
(pattern and size) differ strikingly from controls; therefore no sample size esti- 
mation was performed. However, sample number was chosen to ensure statistical 
significance, which was assessed using a two-sided Student's t-test with unequal 
variance. No randomization was done, however all larvae of an experiment were 
kept in the same incubator, as well as dissected and processed together using iden- 
tical solutions in order to minimize variation between the different experimental 
groups. Blinding was not possible due to the obvious phenotypes observed. For 
quantitative measurements, the centre values represent the arithmetic mean and 
the error bars show standard deviation, expect for boxplots (Fig. 3f, Fig. 4d, and 
Fig. 5k, bottom), where centre value correspond to the median and the whiskers 
mark the maximum and minimum data points. 

Immunostainings and image acquisition. Staged larvae were dissected and 
transferred directly to cold fixative (4% PFA in PBS) and fixed for 20 min at room 
temperature or 40 min at 4°C (for p-Mad and Brk stainings) rotating. After fix- 
ation, discs were extensively washed with PBT (PBS plus 0.3% Triton-X) and 
blocked in PBTN (PBT plus 2% normal donkey serum; Jackson Immuno Research 
Laboratories) for 30 min at room temperature, followed by incubation with pri- 
mary antibody overnight at 4°C. The next day discs were washed in PBT six times 
for 20 min and incubated in secondary antibody for 1.5h at room temperature 
on a rotor. After another round of washes with PBT, samples were mounted in 
Vectashield (H-1000, Vector Laboratories). All discs of one data set were mounted 
on the same slide using larval brains as spacers. For all quantitative data sets we 
made sure that imaging conditions allowed acquisition of data in the linear range 
(Extended Data Fig. 10). For high-resolution imaging along the z-axis (Fig. 1b), 
discs were mounted with double-sided tape as spacers to avoid squeezing of the 
discs. The extracellular GFP staining was done as described previously*’. Images 
were acquired on a Leica SP5 confocal microscope (section thickness 1 um for data 
sets, 0.13 um for optical cross-section in Fig. 1b). 

BrdU labelling. Discs were dissected in Schneider’s insect medium, followed by a 
1h incubation in Schneider's plus 75 ug ml“! BrdU (Sigma, B5002) at room tem- 
perature. This was followed by two 5 min washes in Schneider’s and one 5 min 
wash in PBS. Then discs were fixed for 15 min in PBS plus 4% PFA, followed by 
another 15 min fixation in PBS plus 4% PFA plus 0.6% Triton-X-100. Discs were 
permeabilized for 60 min in PBS plus 0.3% Triton-X-100 and transferred to a 1:1 
mixture of PBS plus 0.6% Triton-X-100: 4 N HCl for 30 min. This was followed by 
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extensive washes in PBS plus 0.3% Triton-X-100. Discs were incubated overnight in 
anti-BrdU (1:100, Becton Dickson, 347580) in PBS plus 0.3% Triton-X-100. 
Washing, incubation in secondary antibody and mounting were done as described 
earlier. 

Antibodies. rb-anti-p-Mad (1:1,500; E. Laufer*?); rb-anti-phospho-Smad1/5 
(1:200; Cell Signaling, 9516S; used in Extended Data Fig. 4d-f); gp-anti-Brk 
(1:1,000; Gines Morata); rb-anti-Sal (1:40; R. Schuh*‘); rat-anti-Sal (1:700; 
R. Barrio®°); rb-anti-Omb (1:1,200; G. O. Plugfelder*); m-anti-W¢ (also known as 
4D4-s; 1:120; DSHB, University of Iowa); m-anti-Ptc (also known as Apal-s; 1:40; 
DSHB, University of Iowa); rb-anti-GFP (1:200 for extracellular staining; Abcam 
ab6556); anti-BrdU (1:100; Becton Dickson, 347580). All secondary antibodies 
from the AlexaFluor series were used at 1:750 dilutions except for Alexa405-anti-rb 
and Alexa680-anti-m, which were used at 1:500 dilutions; CF405S-anti-gp was 
used 1:1,000 (Sigma-Aldrich). 

Image processing. Images were processed using Image] (National Institutes 
of Health) software. Concentration profiles in Fig. 1, Extended Data Fig. 3 and 
Extended Data Fig. 5f, right, were created using the Plot Profile function in ImageJ. 
Optical cross-section in Fig. 1b was created using the section function in Imaris 
(Bitplain) software. We made use of the Wg/Ptc co-staining“*, which outlines the 
wing pouch (Wg), the D/V boundary (Wg) and the A/P boundary (Ptc, see also 
Extended Data Fig. 2). Quantification of wing pouch size and extraction of average 
gradient profiles (Figs 2g, h, 3f and Extended Data Figs If, 2d, e, 4c, f, i, 5e, 8f, 9g) 
were done using the Wing] software”” (http://tschaffter.ch/projects/wingj/). For 
measuring gradient profiles in Wing], we used average projections of ten con- 
secutive slices spanning the disc proper epithelium only. Gradient profiles were 
extracted using Wing] software either only in the pouch (Extended Data Fig. 2) or 
up to the edge of the wing disc (Fig. 2 and Extended Data Fig. 4), which allowed a 
better representation of lateral Brk profiles. Profiles were measured with a Sigma of 
4px and either 15% ventral offset (for Extended Data Fig. 2e) or 30% dorsal offset 
(for all other profiles) parallel to the D/V border (marked by the Wg staining). 
Plotting of average concentration profiles was done applying the Matlab toolbox 
included in Wing] using the Matlab (Matworks) software. 

Generation of mitotic density maps. Wing discs were staged and stained for 
We/Ptc and p-H3, a marker labelling mitotic cells. p-H3-positive nuclei were 
detected using the Imaris software (Bitplane) spot detection tool; peripodial 
nuclei were excluded from the following computation. Each disc was marked at 
15 landmarks (see Extended Data Fig. 8e). Sixteen discs of one time point were 
fitted to a reference disc using these landmarks by an affine transformation (least 
square, Fiji-Landmark correspondence plug-in). All data points of these 16 discs 
were included in a scatter plot using the Scatplot script (A. Sanchez-Barba; http:// 
www.mathworks.com/matlabcentral/fileexchange/8577-scatplot) in Matlab. The 
Scatplot visualizes data point density by a colour map, with high-density regions 
appearing in red and low-density regions in blue. 

The mitotic density in Fig. 3f was calculated by normalizing the number of 
p-H3-positive cells in the anterior or posterior pouch to the corresponding pouch 
area. Statistical significance was assessed using a two-sided Student’s t-test with 
unequal variance. 

Induction and computation of Raeppli clones. In our experiments we used two 
copies of nuclear Raeppli, resulting in ten different colour combinations after 
induction (see ref. 43). The larvae were staged as described earlier and dissected 
at 96-100h AEL. Raeppli was induced by heat shock (38°C for 15 min) at three 
different developmental time points: 55-59 h AEL (~41h before dissection), 
66-70h AEL (~30h before dissection) or 76-80 h AEL (~20h before dissection). 
Discs were fixed in 4% PFA in PBS for 20 min at room temperature, washed in PBT 
extensively and mounted in Vectashield (H-1000, Vector Laboratories). Images 
were acquired on a Leica SP5 confocal microscope using the settings suggested 
previously*?, Number of cells per clone was counted using the ‘multi-point tool’ in 
ImageJ software (National Institutes of Health). A two-sided Student's t-test with 
unequal variance was used to test for statistical significance. 

Measuring growth of the medial and lateral domain of the wing disc. To com- 
pare the growth dynamics of the medial (high Dpp signalling) and the lateral 
domain (low Dpp signalling), we define the position of half-maximum Brk levels 
as the boundary between these two domains. The position of half-maximum Brk 
levels was accessed by extracting Brk intensity profiles along a straight line with 
30% dorsal offset parallel to the D/V boundary (Extended Data Fig. 8f) in each 
disc individually. Subsequently single Brk profiles—separately for the anterior and 
the posterior compartment—were fit to a Hill function (see Extended Data Fig. 8f, 
graph 3) using the fitting-toolbox in Matlab. For fitting we excluded the lateral- 
most signal, which is noisy due to folds and signal form the peripodial membrane. 
The Hill function to which we fit the Brk profiles returns four parameters: the 
amplitude A, a measure for how sharp the profile drops n, a constant offset C, 
and the position of half-maximum Brk levels k (k, and kp for the anterior and the 
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posterior compartment, respectively). To access the width of the lateral domain, 
we measured the width of the full compartment L, and Lp for the anterior and 
the posterior compartment, respectively. Since ka equals the width of the anterior 
medial domain, L4 — k, equals the width of the anterior lateral region, and accord- 
ingly Lp — kp equals the width of the posterior lateral domain. Medial domain width 
in case of the posterior compartment in eGFP::Dpp morphotrap co-expressing 
wing discs was not fit to a Hill function, since in this condition only one cell row 
experiences Dpp signalling during the observed time window (equalling a width 
of 3.5m on average). 
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Extended Data Figure 1 | eGFP::Dpp can compensate for endogenous 
Dpp during wing disc development. a, Part of the protein sequence of 
the Dpp protein. The two different eGFP insertion sites”, and the two 
furin cleavage sites** located in this region are marked. Furin cleavage 
of the inactive pro-form yields the active carboxy-terminal mature 
ligand. However, potential processing at cleavage site II may result in 
uncoupling of the eGFP from the mature ligand in the construct described 
previously'®. We therefore inserted the EGFP C-terminal to the second 
furin cleavage site as was done previously’. b-d, Immunostainings 

for p-Mad and Brk in wild-type (b), dpp**“? mutant (c) and dpp?*/4? 
mutant wing discs rescued with eGFP::Dpp expressed under control 

of the dpp::LG* line (d). In the dpp“*“!? mutant wing discs expressing 
eGFP::Dpp, the p-Mad and Brk profiles are rescued to a control-like 
pattern (d, bottom). The pouch outline and the A/P boundary (assessed 
by Wg/Ptc pattern, data not shown) are marked by dotted lines. 

e, The eGFP::Dpp gradient visualized by eGFP fluorescence or by an 
immunostaining for the extracellular fraction of eGFP (bottom). 

f, Quantification of wing pouch area assessed by the inner Wg ring of 


eGFP::Dpp!& , dpp%%/472) @ 


eGFP::Dpp!S , app48/412 


Pouch area (1m?) 


d8/d12 


mutant n = 10, rescue 


98-100h old wing discs (wild type n=6, dpp 
d8/d12 


n= 10; red crosses are outliers). eGFP::Dpp expression in dpp 
mutants rescues pouch area close to wild-type size. g-i, Wing discs of 
98-100h old larvae stained for Drosophila Serum response factor (DSRF; 
also known as blistered). DSRF is expressed in the future intervein tissue 
of the wing disc. Positions of prospective wing veins 3, 4 and 5 are marked 
by arrowheads. The vein pattern is largely restored in mutant discs rescued 
by eGFP::Dpp expression (i)). j-1, Adult wings of a wild-type fly (j), 

a dpp*? mutant (k) and a dpp“*“!? mutant expressing eGFP::Dpp (1) 

(W, wing). Rescued wings have a slightly elongated shape but their sizes 

are comparable to that of control wings. However, they show some 
additional vein tissue at the anterior cross-vein and wing vein 4 is absent in 
the distal part of the wing (marked by arrowhead). We speculate that this 

is due to lower eGFP::Dpp expression in the ventral compartment, which 
also manifests itself in lower ventral p-Mad levels (see d) and less well 
defined ventral vein patterns in the dSRF staining (i, arrow). Apart from 
these drawbacks, LexA-driven eGFP::Dpp can compensate for endogenous 
Dpp during wing disc development. 
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Extended Data Figure 2 | Morphotrap expression does not affect growth 
or patterning of the wing disc. a, Wing disc expressing morphotrap 

in the posterior compartment controlled by hh-Gal4 (morphotrap""). 

The W¢/Ptc pattern is used as a coordinate system to assess pouch 

size (anterior (A) pouch, left two quadrants; posterior (P) pouch, right 

two quadrants). Gradient profiles are measured parallel to the dorso- 
ventral (D/V) boundary (for example, 15% ventral offset). b, Wild-type 
wing disc stained for p-Mad. c, Wings of a male wild-type fly and a fly 
expressing morphotrap in the posterior compartment under the control 

of hedgehog::Gal4 (morphotrap"). d, Morphotrap" wing discs show no 


morphotrap 


Anti-p-Mad 


significant change in anterior or posterior pouch size (t-test two-sided, 
unequal variance: anterior compartment P > 0.05, posterior compartment 
P> 0.05). e, Posterior expression of morphotrap does not cause obvious 
changes to the p-Mad profile. f, p-Mad pattern of a wild-type wing disc 
expressing eGFP::Dpp in the endogenous Dpp source area. g, Lateral 
morphotrap clones show elevated p-Mad signal at the clone boundary 
facing the Dpp source due to eGFP::Dpp accumulation. The region 
marked by a white rectangle is enlarged to the right. d, e, Control n= 11, 
morphotrap" n=9, error bars in e show s.d. 
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Extended Data Figure 3 | Domain width of Dpp targets dependson Dpp __ d, Rescued wing disc with blocked Dpp spreading stained for BrdU. 


spreading. a, Discs of a dpp**“? mutant rescued with eGFP::Dpp stained Also in the absence of Dpp spreading the uniform BrdU signal is not lost. 
for Dpp targets Sal and Omb. Omb shows a wider distribution than Sal. e, Expression of mCherry-CAAX under the control of the dpp::LexA 

b, dpp**“? mutant wing discs co-expressing eGFP::Dpp and morphotrap. driver line used for the rescue. mCherry-CAAX is a protein with a long 
The regions marked by a dotted rectangle are enlarged to the right of half-life that localizes to the membrane. The graph to the right shows 

the respective image. The dotted red line marks the A/P compartment intensity plot of the region marked on the left. No posterior expression 
boundary. In the absence of Dpp spreading, target domains collapse is observed; however, the protein profile is graded into the anterior 

onto a single cell row in the posterior compartment. In the anterior compartment. Analogous to morphotrap-bound eGFP::Dpp, the stable 
compartment domain borders are less sharp. We hypothesize that this is mCherry—CAAX protein forms a concentration gradient into the anterior 
due to morphotrap-bound eGFP::Dpp that is dragged into the anterior compartment due to dividing cells that are pushed further laterally into 
compartment by dividing cells (see also e). Intensity profiles of the the anterior compartment. f, Wing of a rescued fly with blocked Dpp 
enlarged regions are plotted to the right. c, Wing disc of a dpp“®/4? spreading. The hinge region, arising from the lateral wing disc region, is 
mutant rescued with eGFP::Dpp stained for the proliferation marked present and well patterned. In contrast, the wing field, arising from the 
BrdU. Uniform BrdU signal is obtained along the entire disc tissue. medial wing disc region, is strongly reduced in size and patterning is lost. 
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Extended Data Figure 4 | Time course of eGFP::Dpp spreading, and animals with blocked Dpp spreading (e) stained for p-Mad. When 
signalling and the mitotic index. a-i, Time course of extracellular Dpp spreading is blocked, the p-Mad gradient also collapses onto the 
eGFP::Dpp (exGFP), Dpp signalling (p-Mad) and p-H3 from 64-112h source region at all time points. f, Average p-Mad profiles (control/block: 
AEL of larval development. a, b, Representative discs of the six time n=50/35). g, h, Control discs (g) and discs with blocked Dpp spreading 
points examined of control animals (a) and animals with blocked Dpp (h) stained for p-H3. i, Quantification of the mitotic index (p-H3 spot 
spreading (b) stained for exGFP. The region marked by a red rectangle density). No significant differences were observed between control discs 


is enlarged below each image. eGFP::Dpp spreading is tightly blocked by (black, n = 55) and discs with blocked Dpp spreading (red, n = 43) at 
morphotrap at all time points. c, Average exGFP profiles for all time points —_ any time point (n > 0.05 for all time points, two-sided t-test, unequal 
(control in black/block in red: n = 43/29). d, e, Discs of control animals (d) variance). 
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Extended Data Figure 5 | Shortening of the Dpp gradient by posterior 
morphotrap expression. a, Scheme of morphotrap expression in the 
posterior compartment (using hh::Gal4) in dpp**/“? mutant wing discs 
rescued with eGFP::Dpp. b, Posterior morphotrap expression in the rescue 
background results in strong eGFP signal in the first three cell rows of the 
posterior compartment due to eGFP::Dpp accumulation; after three cell 
rows the eGFP fluorescence signal drops. c, p-Mad staining in a dpp**/4! 
mutant wing disc rescued with eGFP::Dpp. d, p-Mad staining in a dpp**/4! 
mutant wing disc rescued by eGFP::Dpp and expressing morphotrap in the 
posterior compartment. Note that the eGFP::Dpp accumulation (marked 
by a yellow line) directly overlaps with the observed p-Mad signal. 

e, The average p-Mad profiles show that the p-Mad gradient range directly 
depends on the range of Dpp spreading (error bars are s.d.). f, dpp“®/4!? 
mutant wing discs rescued with eGFP::Dpp expressing morphotrap in 

the posterior compartment stained for Sal and Omb (for control discs 


see Extended Data Fig. 3a). The A/P boundary is marked by a dotted 

red line and the range of the eGFP::Dpp accumulation is marked by a 
dotted yellow line. In this condition the domain width of both targets is 
strongly reduced. The Sal domain directly collapses onto the eGFP::Dpp 
accumulation domain. However, Omb, which can be activated at lower 
Dpp signalling levels, shows a slightly wider distribution. We hypothesize 
that this is again due to morphotrap-stabilized eGFP::Dpp being dragged 
into the posterior compartment (as discussed in Extended Data Fig. 3). 
Intensity profiles of the enlarged regions are plotted to the right. 

g, Representative dpp“*/“!? mutant wing discs rescued with eGFP::Dpp 
expressing morphotrap in the posterior compartment stained for Brk at 
the indicated time points (79-12h AEL). In this condition the medial 
region shows strongly reduced growth (compare to Fig. 5a-f). However, 
the growth dynamics of the lateral domain are similar to the lateral growth 
observed in control wing discs (right). 
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Extended Data Figure 6 | Clonal growth rates do not change in the 
absence of Dpp spreading. a—h, Estimation of clonal proliferation 
rates as shown in Fig. 4, inducing Raeppli at different time points: 
either 20h before dissection (a—d) or 41h before dissection (e—-h). 
Discs were dissected at 96-100h AEL. a, e, Representative control discs. 
b, f, Representative discs with blocked Dpp spreading. c, g, Clone size 
(number of cells per clone) plotted against the relative position in the 
posterior compartment (0 corresponding to the A/P boundary and 1 to 


the posterior edge of the disc). Low numbers of small clones in proximity 
to the A/P boundary are found in discs with blocked Dpp spreading 

(red dots), while these small clones are not present in control discs (black 
dots; see also Extended Data Fig. 7). d, h, Boxplots showing the number 
of cells per clone. When the small clones are excluded (right boxplots) no 
significant differences are detected in clonal proliferation between control 
discs and discs with blocked Dpp spreading (P > 0.05). 
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Extended Data Figure 7 | Small clones in discs with blocked Dpp heat shock (HS) at different time points during larval development: 20h 
spreading. a—c, Wing discs with blocked Dpp spreading carrying small (a), 30h (b) and 41h (c) before dissection. The regions marked by a white 


Raeppli clones in proximity of the A/P boundary. Raeppli was induced by rectangle in the left column are magnified to the right. 
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Extended Data Figure 8 | Temporal development and fitting procedure respectively in dp“? mutant wing discs rescued with eGFP::Dpp (black, 


of Brk data set. a, b, As can be seen in Fig. 2f, there is a gap in Brk 
expression in the lateral-most region of the posterior compartment, 
indicative of Dpp expression from another, laterally located source. 
Indeed, it has been shown that Dpp is expressed during the third instar 
larval stage in a posterior, lateral position and exerts a patterning role on 
the wing imaginal disc. However, this late Dpp expression does not affect 
the growth properties of wing disc cells*’. Despite this, the additional 
Dpp source might complicate the interpretation of our growth analyses. 
To circumvent this problem, we also measured the growth properties in 
the anterior compartment in the presence (a) and in the absence of the 
eGFP::Dpp gradient (b; high uniform levels of Brk are indeed present 

in all cells outside the source). Indeed, we found that the lateral anterior 
region still grows despite the absence of the Dpp gradient and the lack of 
Dpp signalling. c, d, Width of the anterior and posterior compartment 


n= 34) and dpp**4!? mutant wing discs co-expressing eGFP::Dpp and 
morphotrap (red, = 37, error bars show s.d.). e, The red dots mark the 
15 points used as landmarks for the affine transformation. Using affine 
transformation allowed us to overlay discs of slightly different shapes and 
sizes when generating the mitotic density maps shown in Fig. 3c, e (see 
also Methods for details). f, Computation of Brk data set shown for the 
posterior compartment: (1) The compartment width L, or Lp was defined 
as the distance from the A/P boundary to the anterior or posterior edge of 
the wing tissue, respectively. Brk profiles were measured along a straight 
line with 30%D offset. (2) Profiles were extracted using Wing] software. 
(3) The single gradients were fitted to the shown Hill function. The fitting 
procedure returns the parameter k, which corresponds to the position of 
half-maximum Brk levels and hence to the width of the medial domain. 
Therefore, the lateral domain equals L — k. 
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Extended Data Figure 9 | Impact of Dpp spreading on wing pouch 

and adult wing size. a, Dpp mutant wing disc rescued with eGFP::Dpp 
stained for Wg (outlining the wing pouch) and Ptc (marking the A/P 
boundary). In this background eGFP::Dpp spreading is not hindered 

and a normal gradient forms. The size of the posterior wing pouch is 
estimated by the area enclosed by the Wg ring and the A/P boundary 
(coloured orange) and plotted in g. b, Adult wing of a rescued fly. The 
border between the hinge region and the wing blade is marked by a dotted 
orange line; the alula is labelled with an A. (Wing is the same as shown in 
Extended Data Fig. 11.) c, Rescued wing disc expressing morphotrap in 
the posterior compartment, reducing Dpp dispersal range in the posterior 
compartment. In this condition pouch size is significantly decreased 

(see g). d, Wing of a rescued fly expressing morphotrap in the posterior 


compartment. The wing blade area is strongly decreased and patterning 
in the posterior part of the wing is lost. e, Rescued wing disc expressing 
morphotrap in the Dpp stripe, completely blocking Dpp spreading, and 
hence gradient formation. Full block of Dpp spreading results in a further 
decrease of the Wg/Ptc-encircled posterior pouch area. f, Wing of a 
rescued fly co-expressing eGFP::Dpp and morphotrap. Full block of Dpp 
spreading results in a strong reduction of wing blade area. Only a small 
amount of unpatterned wing tissue is left, while the hinge region seems 

to be patterned normally (alula is present). g, Plot of the posterior pouch 
area, as accessed by the Wg/Ptc staining shown in (a, ¢, e, right) when Dpp 
spreads normally (black), Dpp spreading is reduced (blue) or when Dpp 
spreading is fully blocked (red). With decreasing Dpp dispersal range also 
the posterior pouch area decreases (n = 22). 


© 2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


exGFP 
Figure 2gt+h 


p-Mad ro 
Extended Data 2e cal 


fluorescent intensities (a.u.) 
fluorescent intensities (a.u.) 


0 10 20 30 40 50 60 10 20 30 40 50 60 
[Alexa405],. vad (relative units) 


e 200 a 250 ? 
p-Mad Brk or 
Figure 2gth Figure 2gth $ 
160} Extended Data 5e 200; Figure 5 Pa 1 


Extended Data 4f Extended Data 5g 


= 
N 
Co 
= 
a 
Oo 


@ 
Oo 
= 
So 
Oo 


fluorescent intensities (a.u.) 
fluorescent intensities (a.u.) 


b 
o 
a 
o 


(0) 4 4 4 4 4 0 
0 10 20 30 40 50 60 10 20 30 40 50 60 


[Alexa405],. Mad (relative units) [CF405s],,. Gp (relative units) 


120 


exGFP 


400 | Extended Data Fig.4c 


fos) 

Oo 

‘ 
‘ 


for} 
Oo 
‘ 

‘ 


& 

Oo 

‘ 
‘ 


20 ae 


“Lak: = 


20 30 40 50 60 


fluorescent intensities (a.u.) 


[Alexa405],, cep (relative units) 


Extended Data Figure 10 | Linear range imaging conditions. a—e, Linear _ to their concentrations under the established imaging conditions. Mean 


range imaging for the quantitative data sets acquired (corresponding intensities were extracted using the Histogram function in ImageJ on the 
figure is labelled at top left in each plot). Dilutions of the secondary whole imaging field of a mean projection. The background fluorescence 
antibodies used (anti-rb-Alexa 405 (blue) and anti-gp-CF405S (green)) in was measured by imaging a slide only containing Vectashield and 
Vectashield mounting medium yield fluorescent intensities proportional subtracted from the mean values. Dotted lines indicate linear fits. 
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Novel antibody-antibiotic conjugate 
eliminates intracellular S. aureus 
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John A. Flygare?, Man Wah Tan!, Eric J. Brown! & Sanjeev Mariathasan! 


Staphylococcus aureus is considered to be an extracellular pathogen. However, survival of S. aureus within host cells 
may provide a reservoir relatively protected from antibiotics, thus enabling long-term colonization of the host and 
explaining clinical failures and relapses after antibiotic therapy. Here we confirm that intracellular reservoirs of S. aureus 
in mice comprise a virulent subset of bacteria that can establish infection even in the presence of vancomycin, and we 
introduce a novel therapeutic that effectively kills intracellular S. aureus. This antibody-antibiotic conjugate consists 
of an anti-S. aureus antibody conjugated to a highly efficacious antibiotic that is activated only after it is released in 
the proteolytic environment of the phagolysosome. The antibody-antibiotic conjugate is superior to vancomycin for 
treatment of bacteraemia and provides direct evidence that intracellular S. aureus represents an important component 


of invasive infections. 


S. aureus is the leading cause of bacterial infections in humans world- 
wide and represents a major health problem in both hospital and 
community settings'. However, S. aureus is not exclusively a patho- 
gen and commonly colonizes the anterior nares and skin of healthy 
individuals. When infection does occur, the most serious infections 
such as endocarditis, osteomyelitis, necrotizing pneumonia and sepsis 
take hold after dissemination of the bacteria into the bloodstream”. 
Over the last several decades, infection with S. aureus has become 
increasingly difficult to treat due to the emergence and rapid spread of 
methicillin-resistant S. aureus (MRSA), which is resistant to all known 
B-lactam antibiotics*. Alarmingly, reduced susceptibility to vancomycin 
and resistance to linezolid and daptomycin have already been reported 
in MRSA clinical strains*. 

Investigations going back at least 50 years have revealed that 
S. aureus is able to invade and survive inside mammalian cells, includ- 
ing the phagocytic cells that are responsible for bacterial clearance*"!'. 
S. aureus is taken up by host phagocytic cells, primarily neutrophils and 
macrophages, within minutes after intravenous infection!?. While the 
majority of the bacteria are effectively killed by these cells, incomplete 
clearance of S. aureus inside blood-borne phagocytes can allow these 
infected cells to act as “Trojan horses’ for dissemination of the bacteria 
away from the initial site of infection. Indeed, patients with normal neu- 
trophil counts may be more prone to disseminated disease than those 
with reduced neutrophil counts®'*'*. Once delivered to the tissues, 
S. aureus can invade various non-phagocytic cell types, and intracellular 
S. aureus in tissues is associated with chronic or recurrent infections 
including osteomyelitis’, recurrent rhinosinusitis'®, pulmonary infec- 
tions!” and endocarditis!®. In addition, intracellular S. aureus has been 
shown to undermine innate immune responses and induce subsequent 
destruction of neutrophils'®”°. Together, these data suggest that ablat- 
ing intracellular S. aureus is key to clinical success, but until now there 
has been no way to test this hypothesis directly. 


Intracellular MRSA are protected from antibiotics 

To confirm the hypothesis that mammalian cells provide a protective 
niche for S. aureus in the presence of antibiotic therapy, we compared 
the efficacy of three major antibiotics that are currently used as the 
standard of care for invasive MRSA infections (vancomycin, dapto- 
mycin and linezolid) against extracellular planktonic bacteria versus 
bacteria sequestered inside murine macrophages (Table 1). All three 
antibiotics failed to kill a highly virulent community-acquired MRSA 
strain, USA300, sequestered inside macrophages exposed to clinically 
achievable concentrations of the antibiotics, consistent with other stud- 
ies showing that the majority of existing antibiotics are inefficient at 
killing intracellular S. aureus both in vitro”! and in vivo~. 


Intracellular S. aureus spreads infection 

To compare directly the virulence of intracellular bacteria versus 
free-living planktonic bacteria and to determine whether the intracel- 
lular bacteria are able to establish infection in the presence of vancomy- 
cin in vivo, mice were infected with equivalent doses of S. aureus taken 
directly from broth culture or bacteria sequestered inside host peritoneal 


Table 1 | Antibiotic minimum inhibitory concentrations for MRSA 


Antibiotics Extracellular MRSA Intracellular MRSA Serum Ciax 
(MIC (ug mi-4)) (MIC (ug mi~)) (ug mi~?) 
Vancomycin 1 >100 50 
Daptomycin 4 >100 60 
Linezolid 0.3 >20 20 
Rifampicin 0.004 50 20 


Extracellular MIC is the minimum antibiotic dose that prevented growth of MRSA seeded in 
trypticase soy broth. Intracellular MIC is the minimum dose that prevented survival of MRSA 
sequestered inside murine macrophages (> indicates continued growth at the highest antibiotic 
dose tested). Data are summarized from three different experiments. Serum Cay, the expected 
serum concentrations for clinically relevant antibiotics?’. 
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Figure 1 | Intracellular MRSA are protected from vancomycin. 

a, Experimental design for generating planktonic versus intracellular 
bacteria for infection and treatment with vancomycin (vanco). 

b, Bacterial loads in kidney, 4 days after infection. c.f.u., colony-forming 
units. Each point represents data from a single mouse (n =5 mice per 
group). ¢, Planktonic or intracellular bacteria were suspended in media 
containing vancomycin and cultured alone or on a monolayer of MG63, 
and surviving bacteria were enumerated 1 day later. Error bars represent 
means + standard deviation (s.d.). d, Survival of MRSA cultured with 
vancomycin with or without MG63 cells. Error bars show s.d. ND, none 
detected. Dashed lines indicate the limit of detection. Representative of 
three independent experiments. 


macrophages and neutrophils (Fig. 1a). Mice infected with intracellu- 
lar bacteria had equivalent or higher bacterial burdens in the kidneys 
4 days after infection compared with those infected with planktonic 
bacteria (Fig. 1b). Infection with intracellular bacteria also resulted in 
more consistent colonization of the brain (Extended Data Fig. 1). To 
characterize this observation further, we infected MG63 osteoblasts 
with either planktonic MRSA or an equivalent number of intracellular 
MRSA, in the presence of vancomycin. Planktonic bacteria exposed 
to vancomycin alone were efficiently killed, (Fig. 1c). However, we 
recovered a small number of surviving bacteria (approximately 0.06% 
of input) associated with the MG63 cells 1 day after infection, which 
had been protected from vancomycin by invasion into osteoblasts. 
MRSA that were sequestered inside peritoneal cells showed a sig- 
nificant increase in both survival and efficiency of MG63 infection 
in the presence of vancomycin. About 15% of intracellular MRSA in 
the leukocytes survived under vancomycin exposure identical to that 
which sterilized cultures of planktonic bacteria. Intracellular S. aureus 
that were sequestered in MG63 cells (Fig. 1d), primary human brain 
endothelial cells and A549 bronchial epithelial cells (Extended Data 
Fig. 2) were also able to increase by almost tenfold over a 24h period 
under constant exposure to a concentration of vancomycin that killed 
free-living bacteria. Together, these data suggest that intracellular reser- 
voirs of MRSA in myeloid cells can promote dissemination of infection 
to new sites in the presence of active antibiotic treatment, and that 
intracellular growth can occur in a variety of cell types despite constant 
antibiotic therapy. 


Designing the antibody- antibiotic conjugate 
These findings suggested that therapies aimed at eliminating intracel- 


lular bacteria may improve clinical success. To test this conclusively, 
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we developed a novel therapeutic that is activated specifically inside 
mammalian cells that have taken up S. aureus. The antibody-antibiotic 
conjugate (AAC) consists of an anti-S. aureus antibody (THIOMAB, 
engineered to contain unpaired cysteines) covalently linked via the 
introduced cysteines to a highly efficacious antibiotic using a cath- 
epsin-cleavable linker containing a novel quaternary ammonium salt 
(Fig. 2a). The resulting AAC has no direct antibacterial activity when 
bound to planktonic S. aureus and does not diffuse into mamma- 
lian cells, owing to the size of the antibody. However, when AAC- 
opsonized bacteria are taken up by host cells, intracellular proteases 
cleave the linker and readily release the antibiotic in its active form™ 
(Fig. 2b). Depending on the antibody specificity, we found that thou- 
sands of AACs can bind to a single bacterium and deliver a sufficient 
concentration of free antibiotic within the phagosome to result in 
bacterial killing. 

The antibody and antibiotic components were carefully chosen 
and optimized for maximal efficacy. A panel of more than 40 anti- 
S. aureus antibodies were cloned and purified from B cells derived 
from the peripheral blood of patients recovering from various S. aureus 
infections and screened for binding to a panel of clinically relevant 
S. aureus strains and to USA300 isolated directly ex vivo from the kid- 
neys of infected mice. We found the greatest extent of binding with 
antibodies directed against wall-teichoic acids (WTAs), pathogen- 
specific polyanionic glycopolymers that are connected to the thick pep- 
tidoglycan layers of Gram-positive bacteria (Fig. 2c). S. aureus produces 
WTAs composed of phospho-ribitol repeating units that are further 
modified by either a- or B-O-linked N-acetylglucosamine (GlcNAc) 
sugars mediated by TarM or TarS glycosyltransferases, respectively**. 
A human immunoglobulin G; (IgG)) that recognizes B-O-linked 
GlcNAc sugar modifications on WTA bound to all S. aureus strains 
tested. Monoclonal antibodies recognizing the a-O-linked GlcNAc 
bound well to S. aureus strains cultured in vitro; however, expression 
of the a-O-linked GlcNAc was absent on some S. aureus isolates. 
On those strains that co-expressed the a- and B-O-linked GlcNac, 
B-specific antibodies yielded consistently higher binding to in vivo- 
derived MRSA (Fig. 2c). Antigen specificity of the antibodies was con- 
firmed by genetic means, such that antibodies against a- or B-GlcNAc 
sugar modifications on WTAs failed to bind to S. aureus strains lacking 
the TarM or TarS glycosyltranferase, respectively (Fig. 2d). We esti- 
mate that ~15,000 B-WTA-specific antibodies can bind to protein-A- 
deficient S. aureus USA300 by measuring binding of radiolabelled 
antibodies (data not shown). 

To characterize antibody binding to WTA at the molecular level 
further, we co-crystallized the Fab fragment from an anti-B- WTA 
antibody in complex with a synthetic form of the minimal repeating 
B-WTA unit (C2 carbon of ribitol-1-phosphate linked to the C1 posi- 
tion in GlcNAc via a B-glycosidic bond) and determined a crystal struc- 
ture of the complex at 1.7 A resolution (Extended Data Table 1). The 
structure reveals how the minimal B-WTA epitope binds to the com- 
plementarity determining region (CDR) of the Fab antibody fragment 
(Fig. 2e). The heavy chain H1 and H3 CDR loops form a binding site for 
the GlcNAc, with the GlcNAc hexose ring stacking against the Trp 33 
indole side chain. Importantly, the antibody has an extended L1 loop 
containing two arginine residues (Arg 27d and 28) that interact with 
the phosphate group on the ribitol backbone. This arginine ‘tweezers’ 
motif triangulates the ribitol phosphodiester backbone in relation to the 
GlcNAc moiety, and is probably a key reason for the B-anomer-specific 
recognition of GlcCNAc-WTA by the antibody. 

An extensive set of criteria was used to choose the optimal antibiotic 
for the AAC platform. We chose the rifamycin class of antibiotics for 
their high potency, unaltered bactericidal activity in low phagolysoso- 
mal pH, and their ability to withstand intracellular insults. A rifamycin 
derivative (a rifalogue) that allowed connection to the linker antibody 
through a tertiary amine (TT. Pillow, manuscript in preparation) had 
optimum physicochemical properties to allow for efficient bacte- 
rial killing. The chemical groups known to make direct contact with 
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Figure 2 | AAC design. a, Model of AAC (not drawn to scale). 

b, Mechanism of AAC action. c, Binding of Alexa-488 anti-B-GlcNAC 
WTA monoclonal antibody (mAb) or anti-a-GlcNAC WTA 
monoclonal antibody, or isotype control antibody, anti-cytomegalovirus 
glycoprotein-D (gD) to USA300 isolated from infected kidneys (n= 3). 
MFI, mean fluorescence intensity. d, Binding of anti-GlcCNAC WTA 
antibodies (red) or isotype control (grey) to protein-A-deficient USA300 
lacking tarM or tarS (n= 3). WT, wild type. e, Crystal structure of 
anti-B-GlcNAc WTA Fab bound to a synthetic minimal B-WTA unit. 


bacterial RNA polymerase are conserved between rifampicin and the 
rifalogue*®. Consistent with this, the minimum inhibitory concentra- 
tion (MIC) for the rifalogue is identical to rifampicin (4 x 10~° M), as 
measured in a conventional growth inhibition assay using planktonic 
bacteria (Fig. 2f). The rifalogue, but not rifampicin, resulted in a more 
than 1,000-fold decrease in the number of viable, but non-replicating, 
bacteria after overnight incubation in minimal PBS buffer (Fig. 2g). The 
rifalogue was also able to kill classically defined persister cells, bacteria 
that presumably enter a dormant state to survive antibiotic treatment 
(for example, ciprofloxacin) of growing cultures”® (Fig. 2h), while 
treatment with rifampicin had no effect on recovery of persister cells, 
in agreement with previous observations”*. Additionally, rifalogue, 
but not rifampicin, was able to kill non-dividing MRSA sequestered 
inside murine macrophages, in part due to its capacity to accumulate 
rapidly in mammalian cells (Extended Data Fig. 3). As predicted, the 
rifalogue had no ability to kill extracellular bacteria when linked to the 
anti-8-WTA monoclonal antibody in the AAC format, but was fully 
bactericidal after the AAC was treated with cathepsin B to release the 
active antibiotic (Fig. 2i). 

To confirm that the linker used in the AAC is cleaved after internali- 
zation of the AAC inside cells, we examined cleavage of the linker with 
a fluorescence resonance energy transfer (FRET)-based probe consist- 
ing of the same anti- MRSA antibody conjugated to two dye molecules 
that were separated by the same linker used in the AAC”’. When the 
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Antibody light chain (pink) and heavy chain (blue) are shown. f, MIC 
determination for rifampicin and rifalogue on USA300 (n=5). g, Survival 
of stationary phase USA300 incubated with 1 x 10~° M rifampicin or 
rifalogue (n =4). h, USA300 bacteria were incubated without antibiotic 
(black) or with 3 ug ml“! ciprofloxacin (Cipro; green, red and grey). 

1 wg ml? of rifalogue (red) or rifampicin (grey) was added as indicated 
(n= 3). i, Intact AAC does not kill planktonic bacteria but does after pre- 
treatment with cathepsin-B (n= 3). g-i, Error bars show s.d. for triplicate 
samples (n= biological repeats). 


fluorescent molecules are covalently associated, fluorescence at 488 nm 
is quenched; upon cleavage of the linker, quenching is lost. Video 
microscopy demonstrated that the linker was cleaved within minutes 
of uptake of the MRSA opsonized with the FRET conjugate (Fig. 3a). 
We confirmed by mass spectrometry analysis that free rifalogue was 
released inside macrophages after uptake of MRSA opsonized with 
active AACs, but not when opsonized with a non-cleavable version 
of the AAC, which was prepared by replacement of the natural amino 
acid L-citrulline with p-citrulline at the P1 position, making the linker 
uncleavable by cathepsins (Fig. 3b). 


AACs eradicate intracellular S. aureus infections 

When the AAC was used to opsonize MRSA, it readily killed bacteria 
inside the macrophages, whereas the same bacteria opsonized with 
non-cleavable B- WTA AAC or monoclonal antibody alone survived 
(Fig. 3c). AAC-opsonized MRSA were killed inside every cell type 
tested, including human macrophages, endothelial and epithelial cell 
lines (Fig. 3d). These data demonstrate that the AAC releases active 
antibiotic and kills S. aureus only after internalization inside host 
cells, providing a unique tool to assess the importance of intracellular 
S. aureus in various settings. To determine whether specifically tar- 
geting intracellular S. aureus could prevent cell-to-cell transfer of bac- 
teria, infected peritoneal cells were incubated with MG63 osteoblast 
cells in the presence of vancomycin (Fig. 3e). Addition of AAC to the 
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Figure 3 | AAC linker is cleaved after internalization of bacteria. 

a, Live cell imaging monitoring cleavage of AAC linker in macrophages 
with FRET-based antibody conjugate (representative of three fields). 
TAMRA, tetramethylrhodamine. b, Mass spectrometric quantification 
of released antibiotic inside macrophages from AAC made with standard 
(cleavable) or non-cleavable linker. mAb, monoclonal antibody. 

c, d, USA300, opsonized with antibody, AAC, or non-cleavable AAC, 


culture media resulted in a significant reduction in the number of viable 
intracellular bacteria recovered 1 day after infection. Similar treatment 
with the unconjugated anti- MRSA antibody plus a tenfold higher molar 
concentration of rifampicin, purported to be one of the best-known 
antibiotics for treatment of intracellular bacteria’’, had very marginal 
efficacy. To confirm that targeting intracellular S. aureus could also 
prevent transfer of infection from intracellular reservoirs in vivo, mice 
were infected by intravenous injection with equivalent doses of plank- 
tonic or intracellular MRSA and then treated with PBS or vancomycin, 
with or without a single dose of AAC. In the presence of vancomycin 
only intracellular bacteria were able to efficiently colonize the brain 
(Extended Data Fig. 1); however, treatment with a single dose of AAC 
effectively eliminated the bacteria that had escaped vancomycin treat- 
ment (Extended Data Fig. 4) 


AAC is superior to vancomycin in vivo 

To determine the role of intracellular infection in bacteraemia, AACs 
were tested in an intravenous infection model. First, we tested the 
efficacy of unconjugated anti-MRSA antibodies by treating mice 
with either anti-B-WTA antibodies, anti-a-WTA antibodies or intra- 
venous immunoglobulin (IGIV), a pooled immunoglobulin prepa- 
ration from ~10,000 humans, 1h before intravenous infection with 
wild-type USA300 MRSA. S. aureus is acommon colonizer of human 
skin and mucosal surfaces and preliminary analysis of multiple sources 
of human serum, including IGIV, demonstrated that human serum 
contains approximately 300 ug ml! of anti-S. aureus antibodies, of 
which ~70% are directed towards the GlcNAc modifications of WTA 
(Extended Data Fig. 5). Even in this prophylactic treatment setting, 
unconjugated antibodies did not have any efficacy in preventing infec- 
tion, whereas treatment with vancomycin eliminated detectable bac- 
teria (Fig. 4a). However, the efficacy of vancomycin was limited when 
treatment was initiated several hours after infection was established 
(Fig. 4b). In at least one model of bacteraemia, 95% of the bacteria 
in blood are found to be associated with neutrophils within 15 min 
(ref. 6), suggesting that failure of vancomycin in this model could be 
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enumerated 48 h after incubation with indicated cells lines. HBMECs, 
human brain microvascular endothelial cells. Horizontal line indicates 
the limit of detection. e, Intracellular USA300 as in Fig. 1a were added to 
a monolayer of MG63 with antibody, AAC, or a mixture of antibody plus 
rifampicin in media containing vancomycin (vanco). Surviving bacteria 
were enumerated 24h later. b-e, Error bars represent means + s.d. from 
triplicate wells. Data are representative of 2-4 biological replicates. 


due to antibiotic escape through survival inside host cells. To test this 
directly, mice were treated with a single dose of AAC 24h after infec- 
tion. A single dose of AAC was efficacious and proved superior to twice 
daily vancomycin treatment initiated at the same time (Fig. 4c and 
Extended Data Fig. 6a). 

Mouse serum has no appreciable anti-S. aureus antibody. To deter- 
mine whether endogenous anti- WTA antibodies found in normal 
human serum might compete for binding with the AAC, SCID mice 
were reconstituted with physiological levels of human IgG by daily 
treatment with IGIV. The resulting SCID-hulgG mice had sustained 
levels of at least 10 mg ml~' of human IgG in the serum and were 
equally susceptible to infection with MRSA compared to untreated 
controls (data not shown). Despite the presence of potentially compet- 
ing antibodies, a single dose of anti- MRSA AAC administered at 24h 
after infection with MRSA was still more efficacious than vancomycin 
(Fig. 4d). An AAC made with an irrelevant antibody (anti-cytomegalo- 
virus glycoprotein D antibody)—which could still bind to protein A 
on MRSA—had much less efficacy. AACs made with anti-B-GlcNAc 
WTA antibodies appeared to be more efficacious than those made with 
anti-a-GlcNAc WTA antibodies (Extended Data Fig. 6b), suggesting 
that the extent of antibody binding determines antibiotic delivery. 
Treatment with the AAC was more effective than treatment with an 
equivalent dose or repeated dosing of unconjugated rifalogue, and 
release of the antibiotic from the AAC was essential for efficacy, as 
an AAC generated with the non-cleavable linker was not efficacious 
in vivo (Fig. 4e). 


Discussion 

Treatment of patients with invasive MRSA infections with conven- 
tional antibiotics results in failure rates of up to 50% (refs 29-32), 
in most cases without measurable outgrowth of antibiotic-resistant 
strains. Survival of antibiotic-susceptible subpopulations of S. aureus 
in the presence of antibiotic treatment is well documented and is 
probably due to multiple factors, including survival of persister bac- 
teria that are not actively dividing and poor antibiotic exposure of 
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Figure 4 | AAC is a more effective treatment than vancomycin after 
intravenous infection. a, Wild-type (WT) mice (m= 8 per group) were 
treated with 50 mg kg“! of the indicated anti-MRSA antibodies 1 h before 
MRSA infection or twice daily with 110 mg kg”! vancomycin (Vanco). 

b, Treatment of wild-type mice (n =5 per group) with 110 mg kg“! 
vancomycin (twice daily) was initiated either at 1h, 7h or 24h after 
infection. c, Wild-type mice (n=5 per group) were treated with 

saline, anti-B-WTA antibody used in the AAC (monoclonal antibody 
(mAb)), vancomycin (twice daily), or anti- MRSA AAC (a single dose of 
50 mg kg!) starting 24h after infection. d, e, SCID mice (d, 5 mice per 


bacteria in protected niches**-*>. However, it has been difficult to 
determine conclusively what role intracellular S. aureus plays in dis- 
ease pathogenesis and antibiotic failure. To address directly the extent 
to which this ability to survive and replicate inside cells contributes 
to the failure of conventional antibiotics, we developed a novel ther- 
apeutic in the form of an AAC that is specifically designed to be 
activated only within host cells. We found that a single dose of AAC 
treatment is effective in a murine model of bacteraemia, in which 
vancomycin, the current standard of care for MRSA infection, failed 
(Fig. 4). This suggests that ablation of the intracellular pool of path- 
ogens is key for clinical success in the treatment of potential fatal 
MRSA infection. 

In designing the AAC to test this hypothesis, we used an antibiotic 
that kills intracellular bacteria. This rifalogue antibiotic also ablates 
non-replicating bacteria, and antibiotic-resistant persister cells. We 
posit that this characteristic, along with its ability to accumulate 
within the intracellular milieu, is essential for AAC success. When 
the closely related rifampicin, which does not have these character- 
istics, was conjugated to the same anti- WTA antibody it did not kill 
intracellular bacteria and exhibited poor efficacy in vivo (Extended 
Data Fig. 8). These studies show that arming antibodies with unique 
bactericidal antibiotics can result in promising new therapies. Many 
potent antibiotic-like compounds with the desired characteristics to 
kill difficult-to-treat intracellular pathogens fail in clinical practice 
owing to poor pharmacokinetic properties or undesired host toxic- 
ity. The use of an AAC might be able to overcome these problems, 
making it a novel platform for delivering potent antibacterial com- 
pounds that may not have a suitable profile as unconjugated drugs. 
Antibiotic failure has been associated with the ability of a variety of 
bacterial pathogens to survive within host phagocytic cells*®. The 


group; e, 10 mice per group) were injected with human IgG to achieve 

a concentration of 10 mg ml~!, then infected with MRSA. Treatment as 
indicated was begun 24h after infection. Vancomycin, 110 mg kg”! twice 
daily; AAC, 50 mg kg~ a single dose on day 1; rifalogue, 0.5 mg kg"! 
either once on day 1, or once daily (days 1-3). a-e, Bacteria in kidneys 
were determined 4 days after infection. Schematic of each experimental 
design is shown in Extended Data Fig. 7. Each point shows data from a 
single animal. Bars show geometric mean. Dashed lines indicate limit of 
detection. Mann-Whitney U-test. *P < 0.05; **P < 0.01; ***P < 0.005. 


extent to which other common human pathogens beyond S. aureus 
depend on this intracellular protected niche merits investigation; the 
AAC platform promises to enhance antibiotic efficacy against these 
infectious diseases. 

Online Content Methods, along with any additional Extended Data display items and 


Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Ethics statement. All animal procedures were conducted under a protocol 
(#08-1990) approved by the Genentech Institutional Animal Care and Use 
Committee in an Association for Assessment and Accreditation of Laboratory 
Animal Care (AAALAC)-accredited facility in accordance with the Guide for the 
Care and Use of Laboratory Animals and applicable laws and regulations. For 
cloning of antibodies from human B cells, informed written consent was obtained 
from all donors and was provided in accordance with the Declaration of Helsinki. 
Approval was obtained from the health research ethics committee of Denmark 
through the regional committee for the Capital Region of Denmark. 

Bacterial strains. All in vivo experiments were done with MRSA-USA300 NRS384 
obtained from NARSA (https://www.beiresources.org) unless noted otherwise. 
The generation of protein-A-deficient strain Amcr USA300 NRS384, as well as 
protein-A-deficient USA300 lacking tarM or tarS has been described pre- 
viously**”’. The protein-A-deficient strains were used only in some in vitro experi- 
ments to determine antibody specificity. 

MIC determinations for extracellular bacteria. The MIC for extracellular 
bacteria was determined by preparing serial twofold dilutions of the antibiotic 
in tryptic soy broth. Dilutions of the antibiotic were made in quadruplicate in 
96-well culture dishes. MRSA (NRS384 strain of USA300) was taken from an 
exponentially growing culture and diluted to 1 x 10*c.f.u. ml~!. The bacteria 
were cultured in the presence of antibiotic for 18-24 h with shaking at 37°C and 
bacterial growth was determined by reading the optical density (OD) at 630 nm. 
The MIC was determined to be the dose of antibiotic that inhibited bacterial 
growth by >90%. 

MIC determinations for intracellular bacteria. Intracellular MIC was determined 
on bacteria that were sequestered inside mouse peritoneal macrophages (see later 
for generation of murine peritoneal macrophages). Macrophages were plated at a 
density of 4 x 10° cells ml“! and infected with MRSA at a ratio of 10-20 bacteria 
per macrophage. Macrophage cultures were maintained in growth media sup- 
plemented with 50 ug ml! of gentamycin to inhibit the growth of extracellular 
bacteria and test antibiotics were added to the growth media 1 day after infection. 
The survival of intracellular bacteria was assessed 24h after addition of the antibi- 
otics. Macrophages were lysed with Hanks buffered saline solution supplemented 
with 0.1% bovine serum albumin (BSA) and 0.1% Triton-X, and serial dilutions 
of the lysate were made in PBS solution containing 0.05% Tween-20. The number 
of surviving intracellular bacteria was determined by plating on tryptic soy agar 
plates with 5% defibrinated sheep blood. 

In vivo transfer of infection model. USA300 stocks were prepared for infection 
from actively growing cultures in tryptic soy broth. Bacteria were washed three 
times in PBS and aliquots were frozen at —80 °C in PBS 25% glycerol. 
Intracellular bacteria infections. Seven-week-old female A/J mice (Stock 000646) 
were obtained from Jackson Labs and infected by peritoneal injection with 
5 x 10’ c.f.u. of USA300. Mice were killed 1 day after infection and the perito- 
neum was flushed with 5 ml of cold PBS. Peritoneal washes were centrifuged for 
5 min at 1,000 p.m. at 4°C in a table-top centrifuge. The cell pellet containing 
peritoneal cells was collected and cells were treated with 50 ug ml! of lysostaphin 
(Cell Sciences, CRL 309C) for 20 min at 37°C to kill contaminating extracellular 
bacteria. Peritoneal cells were washed three times in ice-cold PBS to remove the 
lysostaphin. Peritoneal cells from donor mice were pooled, and recipient mice 
were injected with cells derived from five donors per each recipient by intrave- 
nous injection into the tail vein. To determine the number of live intracellular 
colony-forming units, a sample of the peritoneal cells were lysed in HB (Hanks 
balanced salt solution supplemented with 10mM HEPES and 0.1% BSA) with 0.1% 
Triton-X, and serial dilutions of the lysate were made in PBS with 0.05% Tween-20. 
Free bacteria infections. A/J mice were infected with various doses of free bacteria 
using a fresh aliquot of the glycerol stocks used for the peritoneal injections. Actual 
infection doses were confirmed by c.f.u. plating. For the data shown in Fig. 1a the 
actual infection dose for intracellular bacteria was 1.8 x 10°c.f.u. per mouse, and 
the actual infection dose for free bacteria was 2.9 x 10° c.f.u. per mouse. Selected 
mice were treated with a single dose of 110mg kg ' of vancomycin by intravenous 
injection immediately after infection. 

In vitro transfer of infection to non-phagocytic cells. Generation of MRSA- 
infected peritoneal cells. Six-to-eight-week-old female A/J mice (see earlier) were 
infected with 1 x 10%c.f.u. of the NRS384 strain of USA300 by peritoneal injec- 
tion. The peritoneal wash was harvested 1 day after infection, and the infected 
peritoneal cells were treated with 50g ml”! of lysostaphin diluted in HEPES 
buffer supplemented with 0.1% BSA (HB buffer) for 20 min at 37 °C. Peritoneal 
cells were then washed twice in ice-cold HB buffer. The peritoneal cells were 
diluted to 1 x 10° cells ml~! in RPMI 1640 tissue culture media supplemented 
with 10 mM HEPES and 10% fetal calf serum, and 5 ug ml! vancomycin. 
Free MRSA from the primary infection was stored overnight at 4°C in PBS 
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solution as a control for extracellular bacteria that were not subject to neutrophil 
killing. 

Infection of osteoblasts, HBMEC and A549 cells. MG63 cell line (CRL-1427) and 
A549 cells (CCL185) were obtained from ATCC and maintained in RPMI 1640 
tissue culture media supplemented with 10 mM HEPES and 10% fetal calf serum 
(RPMI-10). HBMEC cells (catalogue #1000) and ECM media (catalogue #1001) 
were obtained from ScienceCell Research Labs. The cells were used without fur- 
ther authentication or testing for mycoplasma contamination. Cells were plated in 
24-well tissue culture plates and cultured to obtain a confluent layer. On the day 
of the experiment, the cells were washed once in RPMI (without supplements). 
MRSA or infected peritoneal cells were diluted in complete RPMI-10 and van- 
comycin was added at 5 ug ml“! immediately before infection. Peritoneal cells 
were added to the osteoblasts at 1 x 10° peritoneal cells per ml. A sample of the 
cells was lysed with 0.1% Triton-X to determine the actual concentration of live 
intracellular bacteria at the time of infection. The actual titre for all infections was 
determined by plating serial dilutions of the bacteria on tryptic soy agar with 5% 
defibrinated sheep blood. 

Generation of the anti-S. aureus antibodies. The human IgG antibodies against 
anti-B-GlcNAc WTA monoclonal antibody (mAb) and anti-a-GlceNAc WTA mAb 
were cloned from peripheral B cells from patients after S. aureus infection using 
a monoclonal antibody discovery technology that conserves the cognate pairing 
of antibody heavy and light chains*’. Antibodies were expressed by transfection 
of mammalian cells*!. Supernatants containing full-length IgG1 antibodies were 
harvested after 7 days and used to screen for antigen binding by enzyme-linked 
immunosorbent assay (ELISA). These antibodies were positive for binding to 
cell wall preparations from USA300. Antibodies were subsequently produced in 
200-ml transient transfections and purified with protein A chromatography 
(MabSelect SuRe, GE Life Sciences) for further testing. 

Synthesis of the linker drug. Synthesis of the rifalogue linker drug was per- 
formed as follows. Protease cleavable linker MC-VC-PAB-OH?? (1.009 g, 
1.762 mmol, 1.000, 1,009 mg) was taken up in N,N-dimethylformamide 
(6 ml, 77 mmol, 44, 5,700 mg). To this was added a solution of thionyl chlo- 
ride (1.1 equiv., 1.938 mmol, 1.100, 231 mg) in dichloromethane (DCM) (1 ml, 
15.44 mmol, 8.765, 1,325 mg) in portions dropwise (half was added over 1h, 
stirred for 1h at room temperature, then the other half was added over another 
hour). The solution remained a yellow colour. Another 0.6 equiv. of thionyl 
chloride was added as a solution in 0.5 ml DCM dropwise, carefully. The reaction 
remained yellow and was stirred sealed overnight at room temperature. The 
reaction was monitored by liquid chromatography mass spectrometry (LC/MS), 
indicating 88% conversion to benzyl chloride. Another 0.22 equiv. of thionyl 
chloride was added dropwise as a solution in 0.3 ml DCM. When the reaction 
approached 92% benzyl chloride, the reaction was bubbled with N>. The con- 
centration was increased from 0.3 M to 0.6 M. 

MC-VC-PAB-Cl (0.9 mmol) was cooled to 0°C and rifalogue (dimethyl piper- 
azinebenzoxazinorifamycin” (0.75 g, 0.81 mmol, 0.46, 750 mg)) was added. The 
mixture was diluted with another 1.5 ml of DMF to reach 0.3 M. Stirred open to 
air for 30 min. N,N-diisopropylethylamine (3.5 mmol, 3.5 mmol, 2.0, 460 mg) was 
added and the reaction stirred overnight open to air. Over the course of 4 days, 
four additions of 0.2 equiv. N,N-diisopropylethylamine base were added while the 
reaction stirred open to air, until the reaction appeared to stop progressing. The 
reaction was diluted with DMF and purified on high-performance liquid chroma- 
tography (HPLC; 20-60% ACN/FA-H,0) in several batches to give MC-VC-PAB- 
rifalogue (0.38 g, 32% yield) m/z= 1,482.8. 

The non-cleavable rifalogue linker drug was synthesized using the exact same 
method, but replacing MC-VC-PAB-OH with MC-V-D-Cit-PAB-OH. 
Conjugation of the linker drug to antibody. Construction and production of 
the THIOMAB variant of anti- WTA antibody was done as reported previously”. 
Briefly, a cysteine residue was engineered at the Val 205 position of the anti-WTA 
light chain to produce its THIOMAB variant. The thio anti- WTA was conju- 
gated to MC-vc-PAB-rifalogue. The antibody was reduced in the presence of 
50-fold molar excess dithiothreitol (DTT) overnight. The reducing agent and 
the cysteine and glutathione blocks were purified away using HiTrap SP-HP col- 
umn (GE Healthcare). The antibody was re-oxidized in the presence of 15-fold 
molar excess dehydroascorbic acid (MP Biomedical) for 2.5h. The formation of 
interchain disulfide bonds was monitored by LC/MS. A threefold molar excess 
of the linker drug (MC-VC-PAB-rifalogue) over protein was incubated with the 
THIOMAB for 1h. The AAC was purified by filtration through a 0.2 um SFCA 
filter (Millipore). Excess-free linker drug was removed by filtration. The conjugate 
was buffer exchanged into 20 mM histidine acetate pH 5.5/240 mM sucrose by 
dialysis. The number of conjugated MC-VC-PAB-rifalogue molecules per mAb 
was quantified by LC/MS analysis. Purity was also assessed by size-exclusion 
chromatography. 


© 2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Mass spectrometric analysis. LC/MS analysis was performed on a 6530 
Accurate-Mass Quadrupole Time-of-Flight (Q-TOF) LC/MS (Agilent 
Technologies). Samples were chromatographed on a PRLP-S column, 1,000 A, 
8um (50mm x 2.1mm, Agilent Technologies) heated to 80°C. A linear gradi- 
ent from 30-60% B in 4.3 min (solvent A, 0.05% TFA in water; solvent B, 0.04% 
TFA in acetonitrile) was used and the eluent was directly ionized using the elec- 
trospray source. Data were collected and deconvoluted using the Agilent Mass 
Hunter qualitative analysis software. Before LC/MS analysis, AAC was treated 
with lysyl endopeptidase (Wako) for 30 min at 1:100 w/w enzyme to antibody 
ratio, pH 8.0, and 37 °C to produce the Fab and the Fc portion for ease of analysis. 
The drug-to-antibody ratio (DAR) was calculated using the abundance of Fab and 
Fab+1 calculated by the MassHunter software. 

Flow cytometry to compare expression of anti-MRSA antibodies. Analysis of 
bacteria isolated from infected mice. Balb/c mice were infected with 1 x 107 c.f.u. 
of MRSA (USA300) by intravenous injection and kidneys were harvested on day 
3 after infection. Kidneys were homogenized using a Gentle MACS dissociator 
in 5 ml volume per two kidneys using M-Tubes and the program RNAO1.01 
(Miltenyi Biotec). Homogenization buffer was: PBS plus 0.1% Triton-X-100, 10 ug 
ml! DNAase (bovine pancreas grade II, Roche) and protease inhibitors (com- 
plete protease inhibitor cocktail, Roche 11-836-153001). After homogenization, 
the samples were incubated at room temperature for 10 min and then diluted with 
ice-cold PBS and filtered through a 40 tm cell strainer. Tissue homogenates were 
washed twice in ice-cold PBS and then suspended in a volume of 0.5 ml per two 
kidneys in HB buffer (Hanks balanced salt solution supplemented with 10 mM 
HEPES and 0.1% BSA). The cell suspension was filtered again and 25 ul of the 
bacterial suspension was taken for each staining reaction (Fig. 2c). 

Antibody staining for flow cytometry. Bacteria (1 x 10’ of in vitro grown bacteria 
(Fig. 2d), or 25 ul of tissue homogenate described earlier (Fig. 2c) were sus- 
pended in HB buffer and blocked by incubation with 400 ug ml“! of mouse IgG 
(Sigma, 15381) for 1h. Fluorescently labelled antibodies were added directly 
to the blocking reaction and incubated at room temperature for an additional 
10-20 min. Bacteria were washed three times in HB buffer and then fixed in 
PBS 2% paraformaldehyde before FACS analysis. Test antibodies (anti-B-WTA, 
anti-a-WTA or isotype control-anti CMV-gD) were conjugated with Alexa-488 
using amine reactive reagents (Invitrogen, succinimidyl-ester of Alexa Fluor 
488, NHS-A488). Antibodies in 50 mM sodium phosphate were reacted with a 
5-10-fold molar excess of NHS-A488 in the dark for 2-3 h at room temperature. 
The labelling mixture was applied to a GE Sepharose $200 column equilibrated 
in PBS to remove excess reactants from the conjugated antibody. The number of 
A488 molecules per antibody was determined using the ultraviolet method as 
described by the manufacturer. 

For analysis of bacteria in tissue homogenates a non-competing anti-S. aureus 
antibody (rF1 (ref. 38)) was conjugated to Alexa-647 to distinguish S. aureus from 
similar sized particles. Test antibodies were examined at a range of doses from 
80ng ml! to 50 ug ml !. Flow cytometry was performed using a Beckton Dickson 
FACS ARIA (BD Biosciences) and analysis was performed using FlowJo analysis 
software (Flow Jo LLC). 

WTA-antibody complex purification and crystallization. The anti-B-WTA 
antibody Fab fragment was expressed in Escherichia coli and purified on Protein 
G Sepharose followed by SP sepharose cation exchange and size-exclusion chro- 
matography. Antibody was concentrated to 30 mg ml! in MES buffer (20 mM 
MES pH 5.5, 150mM NaCl) and mixed with a 2:1 mol/mol ratio of the WTA 
analogue (diluted in water) for crystallization trials. Sparse matrix crystallization 
screening provided initial hits in PEG-8000 based conditions, which were further 
optimized to provide diffraction quality crystals. Ultimately, data were collected on 
a crystal grown by the vapour diffusion method in a sitting drop containing 0.5 ul 
protein and 0.5 11 0.08 M sodium cacodylate pH 6.5, 0.16 M calcium acetate, 14.4% 
PEG-8000, and 20% glycerol. Crystals were cryo-protected in mother liquor, flash 
frozen in liquid nitrogen, and stored for data collection at 100K. 

Data collection and structure determination. Data were collected to 1.7 A at 
beamline 22ID at the Advanced Photon Source (APS) under cryo-cooled con- 
ditions (100 K) at a wavelength of 1.0 A. Data were reduced using HKL2000 
and SCALEPACK in the space group P2;2)2;, with unit cell parameters of 
a= 63.7 A, b=111.4A, c= 158.4A (see Extended Data Table 1 for process- 
ing statistics). The structure was solved by sequential molecular replacement 
searches using Fab constant and variable regions (Protein Data Bank accession 
4177) as individual search models. Iterative rounds of manual model adjustment 
with COOT followed by simulated annealing, coordinate, and b-factor refine- 
ment with Phenix and BUSTER (Global Phasing) gave a final model with R/Riree 
values of 20.6% and 23.7% respectively. Ramachandran statistics calculated by 
MolProbity indicate that 97.2% of the model residues lie in favoured regions, with 
0.5% outliers. 


Generation of a synthetic WTA fragment. Synthesis of dibenzyl phosphorochlori- 
date. A mixture of NCS (3.5 g, 26.6 mmol) was suspended in toluene (80 ml). Then 
dibenzyl phosphonate (2.0 g, 7.6 mmol) was added. The mixture was stirred at 
room temperature overnight. The white solid was filtered off and the organic phase 
was evaporated to give dibenzyl phosphorochloridate (1; 2.1 g, 96%) as light yellow 
oil. 'H NMR (300 MHz, CDCl, 25°C) 6 7.36 (s, 10H), 5.20 (m, 4H). 

Synthesis of 4-O-(2-acetamido-3,4,6-tri-O-acetyl-2-deoxy-B -p-glucopyrano- 
syl)-1-O-acetyl-p-ribitol-5-dibenzylphosphate. A mixture of 2 (described in ref. 44) 
(500 mg, 0.95 mmol) dissolved in pyridine (12 ml) was cooled to —30 °C and 1 
(described ref. 44) (595 mg, 2.0 mmol) was added, stirring for 2h at —30 °C and 
warmed to room temperature for 4 h. The mixture was added to HO, and concen- 
trated in vacuo. The residue was purified by column chromatography (silica gel: 
200 to ~300 mesh; dichloromethane: methanol in a 30:1 as eluent) to give 4-O-(2- 
acetamido-3,4,6-tri-O-acetyl-2-deoxy-}-p-glucopyranosyl)-1-O-acetyl-p-ribitol- 
5-dibenzylphosphate (3; 190 mg, 24%) as light yellow solid. 'H NMR (300 MHz, 
Acetone-dg, 25°C) 6 7.29-7.23 (m, 10H), 7.08 (d, 1H), 5.08 (t, 1H), 4.99-4.78 
(m, 6H), 4.31-3.97 (m, 8H), 3.82-3.63 (m, 3H), 1.88 (s, 3H), 1.86 (s, 6H), 1.79 (s, 3H), 
1.69 (s, 3H). LC/MS (m/z) ES+ 784 [M+H]*. 

Synthesis of 4-O-(2-acetamido-3,4,6-tri-O-acetyl-2-deoxy-B-b-glucopyranosyl)- 
1-O-acetyl-p-ribitol-5-phosphate. A mixture of 3 (150 mg, 0.19 mmol) 
dissolved in MeOH (6 ml) was hydrogenated over 10% Pd/C (20 mg) for 2h at 
room temperature. Then the mixture was filtered, and the filtrate was evaporated 
to give 4-O-(2-acetamido-3,4,6-tri-O-acetyl-2-deoxy-B-p-glucopyranosyl)-1-O- 
acetyl-p-ribitol-5-phosphate (4; 100 mg) as light yellow oil. LC/MS (m/z) ES+ 
604 [M+H]*. 

Synthesis of 4-O-(2-acetamido-2-deoxy-B-p-glucopyranosyl)-p-ribitol-5-phosphate 
(5). A mixture of 4 (80 mg, 0.16 mmol) dissolved in MeOH (10 ml) was cooled 
to 5°C and K,CO; (30 mg, 0.21 mmol) was added and stirred at 5°C for 3h. 
The reaction was then quenched with 1 N HCl, and concentrated in vacuo. The 
crude product was purified by gel filtration (LH-20, MeOH) to give 4-O-(2- 
acetamido-2-deoxy-B-p-glucopyranosyl)-p-ribitol-5-phosphate (5; 13.3 mg, 23%) 
as a white solid 'H NMR (300 MHz, MeOH-dy, 25°C) 6 4.62 (d, 1H), 4.30-4.02 
(m, 3H), 3.92-3.32 (m, 9H), 2.03 (s, 3H). LC/MS (m/z) ES+ 436 [M-+H]*. 
Time of kill for free antibiotics on non-replicating bacteria. S. aureus (USA300) 
was taken from an overnight stationary phase culture, washed once in PBS and 
suspended at 1 x 10’ c.f.u. ml! in PBS with no antibiotic or with 1 x 10°°M 
antibiotic in a 10 ml volume in 50 ml polypropylene centrifuge tubes. The bacteria 
were incubated at 37°C overnight with shaking. At each time point, three 1 ml 
samples were removed from each culture and centrifuged to collect the bacteria. 
Bacteria were washed once with PBS to remove the antibiotic and the total number 
of surviving bacteria was determined by plating serial dilutions of the bacteria 
on agar plates. 

Killing of persister cells by free antibiotics. S. aureus (USA300) was taken from 
an overnight stationary phase culture, washed once in tryptic soy broth (TSB) and 
then adjusted to a final concentration of 1 x 10’c.f.u. ml“! in a total volume of 
10 ml of either TSB or TSB with ciprofloxicin (0.05 mM). Cultures were incubated 
with shaking at 37°C for 6h and then the second antibiotic, either rifampicin 
(1 pg ml) or the rifalogue (1 pg ml~') was added. At the indicated times, samples 
were removed from each culture, washed once with PBS to remove the antibiotic 
and re-suspended in PBS. The total number of surviving bacteria was determined 
by plating serial dilutions of the bacteria on agar plates. At the final time point the 
remainder of each culture was collected and plated. 

Cathepsin release assay for AAC. To quantify the amount of active antibiotic 
released from AACs after treatment with cathepsin B, AACs were diluted to 
200 ug ml~! in cathepsin buffer (20 mM sodium acetate, 1 mM EDTA, 5mM 
L-cysteine, pH 5). Cathepsin-B (from bovine spleen, Sigma C7800) was added at 
10g ml”! and the samples were incubated for 1h at 37 °C. As a control, AACs 
were incubated in buffer alone. The reaction was stopped by addition of 9 volumes 
of bacterial growth media, TSB pH 7.4. To estimate the total release of active 
antibiotic, serial dilutions of the reaction mixture were made in quadruplicate 
in TSB in 96-well plates and MRSA (USA300) was added to each well at a final 
density of 2 x 10° c.fu. ml”. The cultures were incubated overnight at 37 °C with 
shaking and bacterial growth was measured by reading absorbance at 630 nM 
using a plate reader. 

Synthesis of anti-8-WTA antibody FRET conjugate. We synthesized and conju- 
gated a maleimide FRET peptide to the anti-8- WTA THIOMAB antibody. We used 
a FRET pair of tetramethylrhodamine (TAMRA) and fluorescein. The maleimide 
FRET peptide was synthesized by standard Fmoc solid-phase chemistry using a 
PS3 peptide synthesizer (Protein Technologies; B.-C.L., M.D. and R.V., manu- 
script in preparation)”. Briefly, 0.1 mmol of Rink amide resin was used to generate 
C-terminal carboxamide. We used a Fmoc-Lys(Mtt)-OH at the N- and C-terminal 
residues in order to remove the Mtt group on the resin and carry out additional 
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side-chain chemistry to attach TAMRA and fluorescein. The sequence of Val- 
Cit-Leu was added between the FRET pair as a cathepsin-cleavable spacer. The 
crude maleimide FRET peptide or maleimidocarproyl-K(TAMRA)-G-V-Cit-L-K 
(fluorescein) cleaved off from the resin was subjected to further purification 
by reverse-phase HPLC with a Jupiter 5u C4 column (5m, 10mm x 250mm; 
Phenomenex). Our FRET probe allows monitoring not only of the intracelluar 
trafficking of the antibody conjugate, but also the processing of the linker in the 
phagolysosome. The intact antibody conjugate fluoresces only in red due to the 
fluorescence resonance energy transfer from the donor. However, upon the sub- 
strate cleavage of the FRET peptide in the phagolysosome, the green fluorescence 
from the donor is expected to appear. 

Video microscopy to detect cleavage of the linker inside macrophages. Murine 
peritoneal macrophages were plated on chamber slides (Ibidi, catalogue 80826) 
in complete media as described for the macrophage intracellular killing assay. 
USA300 was labelled with Cell Tracker Violet (Invitrogen C10094) at 100 ug ml? 
in PBS 0.1% BSA by incubation for 30 min at 37°C. The labelled bacteria were 
opsonized with the anti-8-WTA-FRET probe by incubation for 1h in HB buffer. 
Macrophages were washed once immediately before addition of the opsonized 
bacteria, and bacteria were added to cells at 1 x 107 bacteria per ml. For no- 
phagocytosis controls, the macrophages were pre-treated with 60 nM Latrunculin 
A (Calbiochem) for 30 min before and during phagocytosis. The slides were placed 
on the microscope immediately after addition of bacteria to the cells and movies 
were acquired with a Leica SP5 confocal microscope equipped with an environ- 
mental chamber with CO, and temperature controllers from Ludin. The images 
were captured every minute for a total time of 30 min using a Plan APO CS x 40, 
N.A: 1.25, oil immersion lens, and the 488 nm and 543 nm laser lines to excite 
Alexa-488 and TAMRA, respectively. Phase images were also recorded using the 
543 nm laser line. 

Quantification of released antibiotic inside macrophages. Primary murine peri- 
toneal macrophages or RAW 264.7 cells (purchased from ATCC) were infected 
in 24-well tissue culture dishes as described later for the intracellular killing assay 
with MRSA opsonized with AAC at 100g ml! in HB. The RAW 264.7 cells were 
used without further authentication or testing for mycoplasma contamination. 
After phagocytosis was complete, the cells were washed and 250 ul of complete 
media plus gentamycin was added to wells and the cells were incubated for the 
indicated time points. At each time point, the supernatant and cellular fractions 
were collected followed by acetonitrile (ACN) addition to 75% final concentra- 
tion and incubated for 30 min. Cell and supernatant extracts were lyophilized by 
evaporation under N2 (TurboVap; Biotage) and reconstituted in 100 pl of 50% 
ACN, filtered using a 0.45 glass fibre filter plate (Phenomenex) and analysed by 
LC/MS/MS as follows. 

The rifalogue was separated on an Acquity UPLC (Waters Corporation) 
under gradient elution using a Phenomenex Kinetex XB-C18 column (100A, 
50 x 2.1 mm internal diameter, 2.6 im particle size). The column was maintained 
at room temperature. The mobile phase was a mixture of 10 mM ammonium ace- 
tate in water containing 0.1% formic acid (A) and 90% acetonitrile (B) at a flow 
rate of 1 ml min”. The rifalogue was eluted with a gradient of 3-98% B over 1 min, 
followed by 0.8 min at 98% B, then 0.7 min of 3% B to re-equilibrate the column. 
The injection volume was 10 ul. 

The Triple Quad 6500 mass spectrometer (Ab Sciex) was operated in a pos- 
itive ion multiple reaction-monitoring (MRM) mode. The rifalogue precursor 
(Q1) ion monitored was 927.6 m/z and the product (Q3) ion monitored was 
895.2 m/z with collision energy at 27 eV and declustering potential at 191 V. The 
MS/MS setting parameters were as follows: ion spray voltage, 5,500 V; curtain 
gas, 40 psi; nebulizer gas (GS1), 35 psi, (GS2), 50 psi; temperature, 600°C; and 
dwell time, 150 ms. 

Linear calibration curves were obtained for 0.41-100 nM concentration range 
by spiking rifalogue into cell or supernatant fractions (lacking MRSA or AAC) that 
were treated similarly to samples. Concentrations of rifalogue were calculated with 
MultiQuant software (Ab Sciex). 

In vitro intracellular killing assay. Non-phagocytic cell types. MG63 (CRL-1427) 
and A549 (CCL185) cell lines were obtained from ATCC and maintained in RPMI 
1640 tissue culture media supplemented with 10 mM HEPES and 10% fetal calf 
serum (RPMI-10). HUVEC cells were obtained from Lonza and maintained in 
EGM endothelial cell complete media (Lonza). HBMEC cells (catalogue #1000) 
and ECM media (catalogue #1001) were obtained from ScienceCell Research Labs. 
The cells were used without further authentication or testing for mycoplasma 
contamination. 

Murine macrophages. Peritoneal macrophages were isolated from the peritoneum 
of 6-8-week-old Balb/c mice (Charles River Laboratories). To increase the yield 
of macrophages, mice were pre-treated by intraperitoneal injection with 1 ml of 
thioglycolate media (Becton Dickinson). The thioglycolate media was prepared at 
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a concentration of 4% in water, sterilized by autoclaving, and aged for 20 days to 
6 months before use. Peritoneal macrophages were harvested 4 days after treatment 
with thioglycolate by washing the peritoneal cavity with cold PBS. Macrophages 
were plated in DMEM supplemented with 10% fetal calf serum, and 10 mM 
HEPES, without antibiotics, at a density of 4 x 10° cells well~! in 24-well culture 
dishes. Macrophages were cultured overnight to permit adherence to the plate. 
Human M2 macrophages. CD14* monocytes were purified from normal human 
blood using a Monocyte Isolation Kit II (Miltenyi, catalogue 130-091-153) and 
plated at 1.5 x 10° cells cm~? on tissue culture dishes pre-coated with fetal calf 
serum (FCS) and cultured in RPMI 1640 media with 20% FCS plus 100ng ml“! 
rhM-CSF. Media was refreshed on day 1 and on day 7, the media was changed to 
5% serum plus 20 ng ml! IL-4. Macrophages were used 18 h later. 

Assay protocol. In all experiments bacteria were cultured in TSB. To assess intracel- 
lular killing with AACs, USA300 was taken from an exponentially growing culture 
and washed in HB. AACs or antibodies were diluted in HB (Hanks balanced salt 
solution supplemented with 10 mM HEPES and 0.1% BSA) and incubated with 
the bacteria for 1h to permit antibody binding to the bacteria (opsonization), 
and the opsonized bacteria were used to infect macrophages at a ratio of 10-20 
bacteria per macrophage (4 x 10° bacteria in 250 ul of HB per well). Macrophages 
were pre-washed with serum-free DMEM media immediately before infection, 
and infected by incubation at 37 °C in a humidified tissue culture incubator with 
5% CO, to permit phagocytosis of the bacteria. After 2 h, the infection mix was 
removed and replaced with normal growth media (DMEM supplemented with 10% 
FCS, 10 mM HEPES) and gentamycin was added at 50 ug ml”! to prevent growth 
of extracellular bacteria*’. At the end of the incubation period, the macrophages 
were washed with serum-free media, and the cells were lysed in HB supplemented 
with 0.1% Triton-X (lyses the macrophages without damaging the intracellular 
bacteria). Serial dilutions of the lysate were made in PBS solution supplemented 
with 0.05% Tween-20 (to disrupt aggregates of bacteria) and the total number of 
surviving intracellular bacteria was determined by plating on tryptic soy agar with 
5% defibrinated sheep blood. 

ELISA for quantification of anti-MRSA antibodies in human serum. Cell wall 
preparations (CWPs) were generated from protein-A-deficient S. aureus by incu- 
bating 40 mg of pelleted bacteria per ml of 10mM Tris-HCl (pH 7.4) supplemented 
with 30% raffinose, 100 ug ml! of lysostaphin (Cell Sciences), and EDTA-free 
protease inhibitor cocktail (Roche), for 30 min at 37°C. The lysates were centri- 
fuged at 11,600g for 5 min, and the supernatants containing cell wall components 
were collected. 

ELISA experiments were performed using standard protocols. Briefly, plates 
were pre-coated with CWP and then incubated with human IgG preparations: 
purified human IGIV Immune Globulin (ASD Healthcare), pooled serum from 
healthy donors or from MRSA patients. The concentrations of anti-staphylococcal 
IgG present in the serum or purified IgG were calculated by using a calibration 
curve that was generated with known concentrations of anti-peptidoglycan mAb 
(4479) against peptidoglycan. 

Intravenous infection model for testing efficacy of the AAC. Seven-week- 
old female mice, Balb/c, were obtained from Jackson West, or SCID mice were 
obtained from Charles River Laboratories. Infections were carried out by intra- 
venous injection into the tail vein. SCID-hulgG model: CB17.SCID mice were 
reconstituted with IGIV Immune Globulin (ASD Healthcare) using a dosing 
regimen optimized to achieve constant serum levels of >10mg ml“! of human 
IgG. IGIV was administered with an initial intravenous dose of 30 mg per mouse 
followed by a second dose of 15 mg per mouse by intraperitoneal injection after 
6h, and subsequent daily dosings of 15 mg per mouse by intraperitoneal injection 
for 3 consecutive days. Mice were infected 4h after the first dose of IGIV with 
2 x 10’c.f.u. of MRSA diluted in PBS by intravenous injection. The wild-type 
USA300, protein-A-sufficient strain was used for all in vivo experiments. Mice 
that received vancomycin were treated with twice daily intraperitoneal injections 
of 110 mg kg”! of vancomycin starting between 6 and 24h after infection for the 
duration of the study. Experimental therapeutics (AAC, anti- MRSA antibodies 
or free rifalogue antibiotic) were diluted in PBS and administered with a single 
intravenous injection 30 min to 24h after infection. All mice were killed on 
day 4 after infection, and kidneys were harvested in 5 ml of PBS. The tissue 
samples were homogenized using a GentleMACS dissociator (Miltenyi Biotec). 
The total number of bacteria recovered per mouse (two kidneys) was determined 
by plating serial dilutions of the tissue homogenate in PBS 0.05% Tween on tryptic 
soy agar with 5% defibrinated sheep blood. 

Statistical analysis. All experiments were performed on biological replicates. 
Sample size for each experimental group per condition is reported in appropriate 
figure legends and Methods. For cell culture experiments, sample size was not 
predetermined, and all samples were included in the analysis. In animal experi- 
ments no statistical methods were used to predetermine sample size (n = number 
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of mice per group), and all animals were used for analysis unless the mice died 
or had to be euthanized when found moribund. These cases are annotated in the 
figures. The mice were not randomized after infection, and the investigators were 
not blinded to outcome assessment. When appropriate, statistically significant 
differences between control and experimental groups were determined using 
Mann-Whitney tests. 
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Extended Data Figure 1 | Intracellular MRSA can infect the brain even 
in the presence of vancomycin. In vivo infection of mice shown in 

Fig. la. Mice (n= 5) were infected with equivalent doses of free bacteria or 
intracellular bacteria and treated with vancomycin at 110 mg kg! 10 min 
after infection and then once per day. Bacterial burden was monitored in 
the brain 4 days after infection. Bars show geometric mean. 
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Extended Data Figure 2 | MRSA is able to grow in the presence of 
vancomycin when cultured on a monolayer of infectable cells. 

a, b, Similar to the set up in Fig. 1d, planktonic MRSA were either seeded 
in media alone, or in the presence of vancomycin. Intracellular bacteria 
were generated by infecting a monolayer of either A549 bronchial 


epithelial cells (a) or HBMECs (b) in the presence of vancomycin (vanco). 


In these experiments plates were centrifuged to promote contact of the 
bacteria with the monolayer to enhance intracellular infection. At each 
time point, the culture supernatant was collected to recover extracellular 
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bacteria and adherent cells were lysed to release intracellular bacteria. 
Extracellular bacteria (planktonic bacteria) grew well in media alone, 

but were killed by vancomycin. In wells containing a monolayer of 
mammalian cells (intracellular MRSA + vanco) a fraction of the bacteria 
were protected from vancomycin during the first 5h after infection and 
were able to expand within the intracellular compartment over 24h. Error 
bars show s.d. from triplicate wells. Representative of three independent 
experiments. 
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Extended Data Figure 3 | Rifalogue can also kill intracellular bacteria. 
a, Determining the intracellular MIC for rifalogue and rifampicin. MRSA 
was allowed to infect peritoneal macrophages and macrophages were 
cultured overnight in gentamycin to kill extracellular bacteria. Various 
doses of rifalogue (red) or rifampicin (grey) were added to the culture 
medium 1 day after infection and the number of viable intracellular 
bacteria was determined 24h later by spotting macrophage lysates 

onto agar plates. Data shown are representative of more than three 
independent experiments. b, Diffusion of rifalogue versus rifampicin into 
murine macrophages. Murine peritoneal macrophages were incubated 
with rifalogue or rifampicin in the culture media. Wells were harvested 

at 10 and 60 min and the total amount of antibiotic associated with 

the cells was determined by quantitative mass spectrometry. Results 

are shown as percentage of input. Error bars show s.d. from triplicate 
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wells. Representative of two experiments. Rifalogue is more lipophilic 
than rifampicin as a result of its two additional fused aromatic rings, 
determined by measuring logDs at pH 7, with rifalogue at 3.4, being 
100-fold higher than rifampicin (logD 1.3). Additionally, rifalogue has 

a more basic amine (pK, 9.7) compared to that found in rifampicin 

(pK, 8.2). This balance of lipophilicity and basicity in rifalogue allows 

it to localize preferentially in lysosomes. It is challenging to develop 
antibiotics with these properties for systemic administration due to 
poor pharmacokinetic (PK) properties and toxicity profiles associated 
with indiscriminate accumulation of these molecules in all host cells. 
However, appending such antibiotics to an anti-MRSA specific antibody 
both extends its half-life in the circulation and converts it into an inactive 
pro-drug whose properties are manifest only after it has been released in 
phagolysosomes of cells infected with MRSA. 
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Extended Data Figure 4 | AAC kills MRSA that survive treatment with 


vancomycin. In vivo infection of mice as shown in Fig. 1a. Mice were 


infected with equivalent doses of free bacteria or intracellular bacteria and 


treated with either saline (PBS) or vancomycin (vanco) at 110mg kg™', 
10 min after infection and then once per day. Selected mice were given 
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vancomycin as described earlier and also treated with a single dose of AAC 
at 50 mg kg"! 10 min after infection. Four days after infection, bacterial 
burden was monitored in the brain. Each point represents data 

from a single mouse (n= 5). Bars show geometric mean. *P < 0.05, 
Mann-Whitney U-test. 
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Extended Data Figure 5 | Human serum contains high levels of anti- 
S. aureus antibodies that can compete with the AAC for binding. 

a, To estimate the concentration of antibodies that could potentially 
compete for binding with the anti-B-WTA antibody used in the AAC, 
human IgG from various sources (normal human serum, serum from 
MRSA infected patients, purified human IgG (Sigma) or IGIV derived 
from pooled normal donors) was tested for binding to various bacterial 
cell wall preparations (CWPs) by enzyme-linked immunosorbent assay 
(ELISA). CWPs were made from either USA300 (wild type (WT)) or 
AtarMAtarS (DKO) USA300; the latter strain is deficient in the WTA- 
GlcNAc antigen recognized by the anti- WTA antibodies. For these 
studies protein-A-deficient USA300 background strains were used to 
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minimize non-specific antibody binding. A standard curve was generated 
by titrating known amounts of an anti-MRSA antibody directed against 
peptidoglycan on both cell wall extracts. b, Estimated concentration of 
anti-S. aureus antibodies in human serum. The amount of anti- MRSA 
antibodies in each sample was estimated by comparing the signal obtained 
for each sample with the standard curve. In the absence of WTA GlcNAc 
antigens, ~60—70% less serum IgG binding was observed (DKO ELISA; 
red bars). This indicates the high prevalence of natural antibodies against 
WTA in adult human serum. Results are reported as .g ml“! of antibody 
per 10mg ml? of total IgG. Error bars show mean +s.d. from triplicate 
wells. Data are representative of two independent experiments. 
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Extended Data Figure 6 | Optimization of the in vivo model of 
bacteraemia. a, Titration of AAC in intravenous infection model shown 
in Fig. 4c (n= 8 mice per group). b, Efficacy of AACs specific for B- WTA 
or a-WTA in SCID-IGIV model. SCID mice (n = 8 mice per group) 

were reconstituted with IGIV Immune Globulin using a dosing regimen 
optimized to achieve constant serum levels of >10 mg ml“! of human 


anti-o.-WTA 
AAC 


PBS anti-B-WTA 
AAC 


IgG and infected with MRSA. Mice were treated with 60 mg kg“! of the 
indicated AACs in a single intravenous injection 1 day after infection and 
bacterial burden was monitored in kidneys 4 days after infection. Each 
point represents data from a single animal. Bars show geometric mean. 
Mann-Whitney test: *** P< 0.005, not significant (NS) P > 0.05. 
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AAC release equivalent amounts of free antibiotic after treatment with extracellular bacteria and samples of cell lysates or supernatants were 
exogenous cathepsin-B. Released antibiotics from AACs made with collected in triplicate at indicated time points and the total amount of 
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USA300 grown in broth culture. Error bars show s.d. from triplicate wells. Error bars represent means + s.d. from triplicate wells. a~c, Representative 
b, Intracellular killing assay in primary mouse macrophages as shown in of two or more independent experiments. d, Rifampicin—AAC is not 
Fig. 3c indicates that the rifalogue-AAC, but not rifampicin—AAC is able efficacious in the SCID-IGIV intravenous infection model as shown in 
to kill intracellular S. aureus. Error bars show s.d. from triplicate wells. Fig. 4d. Each point represents data from a single mouse (m = 8 mice per 
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Extended Data Table 1 | Data collection and refinement statistics for anti-8-WTA-WTA complex 


Data collection 
Space group 
Cell dimensions 

a, b,c (A) 

a, B,y (°) 
Resolution (A) 
Rom or Reverse 
T/ol 
Completeness (%) 
Redundancy 


Refinement 
Resolution (A) 
No. reflections (total/test) 
Ror / Rérce 
No. atoms 
Protein 
WTA 
Glycerol 
Calcium 


Glycerol 
Calcium 
Water 

R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 


anti-B WTA Fab-— 
WTA complex 

APS SER-CAT 22ID 
P2,2;2, 


63.71, 111.47, 158.41 
90, 90, 90 

50-1.72 (1.78-1.72) 
0.056 (0.839) 

29.4 (2.2) 

99.9 (99.9) 

6.0 (5.9) 


33.64-1.72 
120147/6101 
20.6/23.7% 


6695 
56 
12 


35.9 
56.4 
49.4 
38.7 
43.3 


0.010 
1.13 


«Values in parentheses are for highest-resolution shell. 
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Ever since Stephen Paget’s 1889 hypothesis, metastatic organotropism has remained one of cancer’s greatest mysteries. Here we 
demonstrate that exosomes from mouse and human lung-, liver- and brain-tropic tumour cells fuse preferentially with resident cells 
at their predicted destination, namely lung fibroblasts and epithelial cells, liver Kupffer cells and brain endothelial cells. We show that 
tumour-derived exosomes uptaken by organ-specific cells prepare the pre-metastatic niche. Treatment with exosomes from lung-tropic 
models redirected the metastasis of bone-tropic tumour cells. Exosome proteomics revealed distinct integrin expression patterns, 
in which the exosomal integrins a,6,, and a,}, were associated with lung metastasis, while exosomal integrin a,6,, was linked to liver 
metastasis. Targeting the integrins a,6, and a,6, decreased exosome uptake, as well as lung and liver metastasis, respectively. We 
demonstrate that exosome integrin uptake by resident cells activates Src phosphorylation and pro-inflammatory S100 gene expression. 


Finally, our clinical data indicate that exosomal integrins could be 


Despite Stephen Paget’s 126-year-old “seed-and-soil” hypothesis’, 
insufficient progress has been made towards decoding the mecha- 
nisms governing organ-specific metastasis. In experimental metas- 
tasis assays, Fidler et al. demonstrated that cancer cells derived from 
a certain metastatic site displayed enhanced abilities to metastasize 
to that specific organ, providing support for Paget’s organ-specific 
metastasis theory”. Subsequent studies investigating organ-specific 
metastasis focused largely on the role of intrinsic cancer cell proper- 
ties, such as genes and pathways regulating colonization, in directing 
organotropism® *. Breast cancer cells express chemokine receptors, 
such as C-X-C motif receptor 4 (CXCR4) and C-C motif receptor 7 
(CCR7), which partner with chemokine ligands expressed in lymph 
nodes (CXCL12) and lung (CCL21), thus guiding metastasis**. 


used to predict organ-specific metastasis. 


Tumour-secreted factors can also increase metastasis by inducing 
vascular leakiness’, promoting the recruitment of pro-angiogenic 
immune cells®, and influencing organotropism’. Furthermore, the 
ability of breast cancer to form osteolytic lesions depends on osteoclast- 
stimulating growth factors (for example, PTHRP and GM-CSF) 
released into the bone microenvironment*®. Therefore, our previ- 
ous observation that metastatic melanoma-derived factors dictate 
organotropism is not surprising”. We found that medium conditioned 
by highly metastatic murine B16-F10 melanoma cells was sufficient to 
expand the metastatic repertoire of Lewis lung carcinoma cells that would 
typically metastasize to the lung’. We also showed that pre-metastatic 
niche formation requires $100 protein and fibronectin upregulation 
by lung resident cells, and the recruitment of bone-marrow-derived 


1Children’s Cancer and Blood Foundation Laboratories, Departments of Pediatrics, an 


Cornell Medicine, New York, New York 10021, USA. “Department of Plant Pathology and Microbiology and Center for Biotechnology, Nationa 
azar Biomedical Sciences Institute, University of Porto, 4099-003 Porto, Portugal. ‘Department of Obstetrics and Gynecology, 
13-8655, Japan. 5Proteomics Resource Center, The Rockefeller University, New York, New York 10065, USA. Department of Pathology, Memorial 
logy and Pathology, Karolinska Institutet, 17176 Stockholm, Sweden. *Electron Microscopy Resource 
USA. °Breast Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, New York, 10065, USA. 


3Graduate Program in Areas of Basic and 
Faculty of Medicine, University of Tokyo, Tokyo 
Sloan Kettering Cancer Center, New York, New York 10065, USA. 7Department of Onco! 
Center (EMRC), Rockefeller University, New York, New York 10065, 
10Department of Surgery, County Council of Os’ 
Life Sciences Division, Lawrence Berke 
13Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, New Yor 
10065, USA. Division of Pediatric Oncology, Alberta Children’s H 
New York, New York 10032, USA. !’Orthopaedic 
Surgery, Oslo University Hospital, Nydalen, Oslo 
?0F bpley Institute for Research in Cancer and A\ 
Universi 
10065, USA. *°Department of Tumor Biology, Norwegian Radium 


Applied Biology, Abel Sa 


Center H 


2Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA. °°Ru 


Service, 


Cell and Developmental Biology, Drukier Institute for Children’s 


ergétland, and Department of Clinical and Experimental Medicine, Faculty of Health Sciences, Linképing 
ey National Laboratory, Berkeley, California 94720, USA. ‘*Genomics Resources Core Facility, Weill Cornell Me 

10065, USA. MDepartment of Pediatrics, Memorial S| 
ospital, Calgary, Alberta T3B 6A8, Canada. ‘Division of Hematology/Oncology, Columbia U 
Service, Department of Surgery, Memorial Sloan Ket 
0424, Norway. !°Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospita 
lied Diseases, University of Nebraska Medical Center, Omaha, Nebraska 68198, USA. *1Department fe) 
‘y of Nebraska Medical Center, Omaha, Nebraska 68198, USA. 2Gastric and Mixed Tumor 
Hospital, Oslo University Hospita 
of Oslo, Blindern, Oslo 0318, Norway. “°Department of Gynecology, University Medical Center, Mart 
amburg-Eppendorf, Martinistrasse 52, 20246 Hamburg, Germany. °’ Department of Radiation Oncology, Abramson Family Cancer Research 
Philadelphia, Pennsylvania 19104, USA. “8Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Phi 


Health, Meyer Cancer Center, Weill 
Taiwan University, Taipei 10617, Taiwan. 


University, 58185 Linképing, Sweden. 
icine, New York, New York 10021, USA. 
ering Cancer Center, New York, New York 
niversity School of Medicine, 

0065, USA. ‘Department of Hepato-Pancreato-Biliary 
, Nydalen, Oslo 0424, Norway. 
Biochemistry and Molecular Biology, 
Service, Department of Surgery, Memorial Sloan Kettering Cancer Center, New York, New York 
_ Nydalen, Oslo 0424, Norway. “Institute for Clinical Medicine, Faculty of Medicine, University 
inistrasse 52, 20246 Hamburg, Germany. “Department of Tumor Biology, University Medical 
nstitute, University of Pennsylvania, 
adelphia, Pennsylvania 19104, USA. 


joan Kett 


ering Cancer Center, New York, New York 


Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York 10065, USA. 32Fred Hutchinson Cancer Research Cen 
33Microenvironment and Metastasis Laboratory, Department of Molecular Oncology, Spanish National Cancer Research Center (CNIO), Madrid 28029, Spain. “*Department o 


, New Jersey 08903, USA. 31Breast Medicine 
er, Seattle, Washington 98109, USA. 
Medicine, Memorial 


gers Cancer Institute of New Jersey, New Brunswic 


Sloan Kettering Cancer Center, New York, New York 10065, USA. 35Department of Medicine, Weill Cornell Medicine, New York, New York 10021, USA. 


*These authors contributed equally to this work. 


19 NOVEMBER 2015 | VOL 527 | NATURE | 329 


© 2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Exosomes injected (MDA-MB-231) 


231 parental 


4175-LuT 


1833-BoT 831-BrT 


aos — Cc, wee e 
S 0.4 3 
cies Zs 
§ 0.3 g 4 ne! 
© 0.2 G z 
+ + i 
° rae 9 2 = 
fn] 0.1 Fal 2 
ie} , 7 0 7 —— ; — =] 
Saree N 231 1833 4175 831 2 
GLP Se & ov -BoT -LuT -BrT 2 
Ww Ww @ x Ol Lung = 
d Bi Liver + 
b Lung @ Liver 155 Brain —_*__ 
20 = 


#% 


= 


231 1833 4175 831 
-BoT -LuT -BrT 


g BoT cells 
PBS 


Liver 


Exosome pps LuT BoT PBS LuT BoT 
education 


Tumour cells LuT = LuT 


Whole 
lung 
N 


BoT BoT 


LuT BoT 
(3 


5.0 x 108 IS 4.0 x 10® 
zs NS NS 
= 4.0x 108 a 3.0 x 108 oF: 
- 
2 3.0x 108 
° 2.0 x 10® 
B 2.0 x 108 g 
x 108 — 
2 1.0% 108 = ee faa] 
= 0 0 
PBS LUT BoT PBS LuT BoT 
exo exo exo exo 
LuT cells BoT cells 
Figure 1 | Cancer-cell-derived exosomes localize to and dictate future 


metastatic organs. a, Biodistribution of human cancer-cell-line-derived 
exosomes in the lung and liver of naive mice. Quantification of exosome- 
positive (Exo*) areas by NIR imaging of whole lung, in arbitrary units (a.u.) 
(n=3 per group). b, Immunofluorescence quantification of exosome-positive 
cells (n = 3, three independent experiments). c, MDA-MB-231- (parental), 
1833-BoT-, 4175-LuT- and 831-BrT-derived exosome biodistribution. 
Quantification of exosome-positive areas by NIR imaging of whole lung 

(n= 3 for all, except 831-BrT, in which n= 4). d, Immunofluorescence 
quantification of exosome-positive cells (1 =5 animals pooled from two 
independent experiments). e, Top, NIR whole-lung imaging of MDA-MB-231 
sublines. BoT, bone-tropic; BrT, brain-tropic; LuT, lung-tropic. Bottom, 


myeloid cells in response to tumour-secreted factors’. These events 
establish a favourable microenvironment that promotes the growth of 
disseminated tumour cells upon their arrival’ "’. 

Recently, we demonstrated that exosomes are one of the tumour- 
derived factors inducing vascular leakiness, inflammation and bone mar- 
row progenitor cell recruitment during pre-metastatic niche formation 
and metastasis'!. Exosomes are small membrane vesicles (30-100 nm) 
containing functional biomolecules (that is, proteins, lipids, RNA 
and DNA) that can be horizontally transferred to recipient cells'*”. 
We showed that an ‘exosomal protein signature’ could identify mel- 
anoma patients at risk for metastasis to nonspecific distant sites’. 
Moreover, in the context of pancreatic cancer exosomes, we defined 
the sequential steps involved in liver pre-metastatic niche induction”’. 

Taken together, these findings led us to investigate whether mole- 
cules present on tumour-derived exosomes are ‘addressing’ them to 
specific organs. To test this idea, we profiled the exosomal proteome 
of several tumour models (osteosarcoma, rhabdomyosarcoma, Wilms 
tumour, skin and uveal melanoma, breast, colorectal, pancreatic and 
gastric cancers), all of which have a propensity to metastasize to spe- 
cific sites (that is, brain, lung or liver). We subsequently analysed the 
biodistribution of tumour-secreted exosomes and found that exosomal 
integrins (ITGs) direct organ-specific colonization by fusing with tar- 
get cells in a tissue-specific fashion, thereby initiating pre-metastatic 
niche formation. Remarkably, we found that tumour-secreted exosomes 
are sufficient to redirect metastasis of tumour cells that normally lack 
the capacity to metastasize to a specific organ. Finally, our clinical 
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fluorescence microscopy of lung, liver and brain injected with MDA-MB-231 
subline-derived exosomes. Arrows indicate exosome foci. All NIR and 
immunofluorescence images are representative of five random fields. 

f, Redirection of metastasis by education with organotropic exosomes. 
4175-LuT or 1833-BoT cell metastasis in the lung after treatment with PBS, 
4175-LuT or 1833-BoT exosomes. Top, quantitative bioluminescence of 
metastatic lesions. Bottom, graphs show quantification of luciferase activity 
(n=5 for all, except for LuT exo/LuT cells, in which n = 4; data representative 
of two independent experiments). g, Lung haematoxylin/eosin staining 

for f. Arrows indicate lung metastasis. Scale bars, 5 mm (e, top, f), 50 um (e, 
bottom) and 500m (g). Data are mean + s.e.m. NS, not significant; * P< 0.05; 
**P < 0.01; ***P-< 0.001 by one-way analysis of variance (ANOVA). 


data indicate that integrin expression profiles of circulating plasma 
exosomes isolated from cancer patients could be used as prognostic 
factors to predict sites of future metastasis. Our findings pave the 
way for the development of diagnostic tests to predict organ-specific 
metastasis and therapies to halt metastatic spread. 


Future metastatic sites uptake exosomes 

To examine whether tumour exosomes colonize specific organ sites, 
we isolated exosomes from organotropic human breast and pancreatic 
cancer cell lines that metastasize primarily to the lung (MDA-MB-231), 
liver (BxPC-3 and HPAF-II), or both (MDA-MB-468). We then 
retro-orbitally injected 10 wg of near infrared (NIR) or red fluorescently 
labelled exosomes into nude mice and, 24h after injection, quanti- 
fied exosome biodistribution and uptake in distant organs by NIR 
whole-lung imaging and confocal microscopy (Fig. la and Extended 
Data Fig. 1a). We observed a more than threefold increase in the uptake 
of MDA-MB-231 and/or 468- versus BxPC-3- and HPAF-II-derived 
exosomes in the lung (Fig. 1a, b). By contrast, liver uptake of BxPC-3 
and HPAF-II exosomes was four times more efficient than that of 
MDA-MB-231 exosomes (Fig. 1a, b). Moreover, mouse E0771 breast 
cancer exosomes were four-to-fivefold more efficiently uptaken in lung, 
whereas mouse Pan02 pancreatic cancer exosomes were four times 
more efficiently uptaken in liver (Extended Data Fig. 1b). Therefore, the 
organ specificity of exosome biodistribution matched the organotropic 
distribution of the cell line of origin in both immune-compromised and 
immune-competent models. 
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Figure 2 | Organ-specific tumour exosomes interact with resident 

cells. a, Heat map of integrin signals from quantitative mass spectrometry 
analysis, based on Z-scored label-free quantification (LFQ) values (technical 
triplicates). b, Western blot analysis of integrins from organotropic 
cell-line-derived exosomes, representative of three independent experiments. 
For western blot source data, see Supplementary Fig. la—h. c, Analysis by 
immunofluorescence of exosome distribution (red) and different resident 


These observations suggested that exosomes could promote 
organ-specific metastasis. We tested whether exosomes from the 
MDA-MB-231 sub-lines that colonize lung, bone or brain (4175-LuT, 
1833-BoT or 831-BrT, respectively)*!-*4 would also exhibit organ 
tropism. Although exosomes from the MDA-MB-231 variants were 
similar in size and morphology (Extended Data Fig. 1c), their bio- 
distribution varied 24h after injection: lung-tropic 4175-LuT exosomes 
preferentially localized to the lung with a more than fourfold increase in 
exosome-positive cells compared to 1833-BoT and 831-BrT exosomes 
(Fig. 1c-e and Extended Data Fig. 1d), whereas 831-BrT exosomes 
efficiently localized to the brain with a more than fourfold increase 
compared to 1833-BoT and 4175-LuT exosomes (Fig. 1c-e). Liver and 
bone showed no significant differences in lung-, brain- or bone-tropic 
MDA-MB-231-derived exosome distribution, with the exception of 
831-BrT exosomes that were uptaken less efficiently by bone mar- 
row cells than exosomes isolated from other MDA-MB-231 sub-lines 
(Fig. 1d, e and Extended Data Fig. le). Taken together, our data sug- 
gest that exosomes from different cancer models recapitulate the organ 
specificity of their cell of origin. 


Exosomes redirect metastatic distribution 

We proposed that, in addition to cell-intrinsic genetic determi- 
nants of organotropism™* , tumour exosomes could also facilitate 
organ-specific metastatic Behosiou: by preparing pre-metastatic niches. 

To gain insight into tumour exosome uptake at future metastatic 
sites, we intravenously injected 4175-LuT exosomes labelled with 
FM1-43 dye into naive animals, then used electron microscopy to 
distinguish endogenous from exogenous exosomes in lung sections. 
We detected tumour FM1-43-labelled exosomes in pre-metastatic 
cells (Extended Data Fig. 2a; red arrows, exogenous tumour-derived 
exosomes; black arrows, endogenous stromal exosomes). Moreover, 
NIR whole-mount lung imaging revealed that NIR-labelled 4175-LuT 
exosomes accumulated in the lungs of naive animals after three con- 
secutive daily exosome injections (Extended Data Fig. 2b). 

To condition or ‘educate’ cells in specific organs, we retro-orbitally 
injected 4175- ful or 1833-BoT exosomes into mice every other day 
for three weeks’. To test exosome education of target organs function- 
ally, luciferase-expressing 4175-LuT or 1833-BoT cells were injected 


High 


BxPC-3 Exo F4/80 (macrophage) 831 Exo CD31 (endothelial cell) 


Liver Brain 


cell types (green). Left to right: lung co-staining with 4175-LuT exosomes 
and S100A4 (fibroblasts) or SPC (epithelial cells), liver co-staining with 
F4/80 (macrophages) and BxPC-3-LiT exosomes, and brain co-staining 

with CD31 (endothelial cells) and 831-BrT exosomes. Scale bar, 30 um. 
Immunofluorescence images are representative of five exosome-positive cells 
each, from n=5 mice. 


into exosome-educated mice via the tail vein (Fig. 1f, g and Extended 
Data Fig. 2c) or intracardially (Extended Data Fig. 2d). Lung-tropic 
4175 exosomes marginally increased the lung-metastatic capacity of 
4175-LuT tumours. Remarkably, education with 4175-LuT-derived 
exosomes, but not with 1833-BoT exosomes or PBS, yielded a signifi- 
cant (sevenfold with intravenous and ten-thousand-fold with intracar- 
diac injections) increase in lung metastatic capacity of 1833-BoT cells 
(Fig. 1f, g and Extended Data Fig. 2d). Conversely, 1833-BoT-derived 
exosomes did not affect 4175-LuT cell metastasis to the lung (Fig. 1f 
and Extended Data Fig. 2c). These data suggest that organotropic 
tumour exosomes prepare pre-metastatic niches potent enough to 
facilitate metastasis even for tumour cells poorly capable of colonizing 
these sites. 


Exosomal ITGs determine organotropism 
We then postulated that exosomal adhesion molecules could regulate 
local microenvironments within future metastatic organs. Quantitative 
mass spectrometry of brain-, lung- and liver-tropic metastatic exosomes 
identified six integrins among the top 40 most abundant adhesion 
molecules, making integrins the most highly represented protein family 
in this analysis. These data indicate a correlation between exosomal 
integrins and metastatic tropism (Extended Data Fig. 3a). 
Interestingly, we found that integrin expression profiles corre- 
lated with tissue organotropism. Both quantitative mass spectrom- 
etry (Fig. 2a) and western blot analysis (Fig. 2b and Extended Data 
Fig. 3b) revealed that integrin alpha 6 (ITGa,), and its partners ITGB, 
and ITGB, (ref. 25), were present abundantly in lung-tropic exosomes. 
By contrast, ITG£, which associates only with ITGa, (ref. 25), was 
detected primarily in liver-tropic exosomes (Fig. 2a, b). We confirmed 
these findings by exosome proteomics for 28 organ-specific metastatic 
cell lines (Extended Data Tables 1 and 2). Qualitative mass spectrometry 
revealed that ITGa, was present in lung-tropic exosomes, whereas 
ITG§, was found in liver-tropic exosomes (Extended Data Tables 1 
and 2), consistent with our quantitative proteomics data. Exosomes 
from 4173, 4175 and 4180 lung-tropic MDA-MB-231 variants 
expressed ITGa,8, (Extended Data Table 1). Meanwhile, ITGB, 
was present in exosomes isolated from brain-tropic cells (Extended 
Data Table 1). Notably, unlike non-cancerous lung fibroblast WI-38 
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Figure 3 | Exosomal ITG6, expression functionally contributes to 
4175-LuT exosome localization and mediates lung metastasis. a, Top, 
NIR whole-lung imaging of 4175-LuT- or 41758,,KD-derived exosomes, or 
4175-LuT-derived exosomes pre-incubated with RGD or HYD-1 peptides. 
Bottom, fluorescence microscopy. Arrowheads indicate exosome foci. 

b, Quantification of exosome-positive areas from the whole-lung images 
in a (top) (n= 4, except 4175, in which n= 6). c, Immunofluorescence 
quantification of exosome-positive cells from a (bottom) (n = 6 pooled 
from two independent experiments). d, Left, NIR whole-lung imaging 

of 1833-BoT (n= 5) or 1833-BoT overexpressing ITGB, (18336 ,OE) 

(n= 4) exosomes. Right, quantification of the exosome-positive areas. e, 
Experimental lung metastasis of 41758 ,KD cells after education with wild- 


or epithelial MCF10A exosomes, metastatic cell exosomes contained 
ITGa,f,, suggesting that this integrin could serve as a biomarker for 
metastasis (Extended Data Table 1). Importantly, exosomal integrin 
expression does not necessarily reflect cellular integrin expression, con- 
sistent with selective packaging of integrins in exosomes (Extended 
Data Fig. 3c). Taken together, our data suggest that exosomal integrin 
expression patterns underlie organotropism to the lung, liver and brain. 


Distinct cells uptake tropic exosomes 

To identify the cells uptaking tumour exosomes in each organ, we 
intravenously injected red fluorescently labelled exosomes from 
4175-LuT, 1833-BoT, BxPC-3-liver-tropic (BxPC-3-LiT) or 831-BrT 
cells into mice (Fig. 2c and Extended Data Fig. 4a, b). Both 1833-BoT 
and 4175-LuT exosomes promoted vascular leakiness 24 h after 
injection, before exosome uptake by specific lung cells (Extended 
Data Fig. 4b). These observations fit with our previous studies 
using melanoma exosomes', suggesting that exosomes first per- 
meabilize vessels, allowing for exosome diffusion before uptake 
by parenchymal cells. Unexpectedly, we found that the specific 
cell type responsible for exosome uptake varied depending on the 
metastatic organ. Lung-tropic 4175 exosomes mainly co-localized 
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type or 41756,KD exosomes. Bioluminescence imaging of lung metastasis 
and quantification of luciferase activity (n= 6, data representative of 

two independent experiments). f, Heat map of S100 gene expression fold 
change by quantitative reverse transcription PCR (qRT-PCR) in 4175-LuT 
or 41758,KD exosome-conditioned lung fibroblast (WI-38) or epithelial 
(HBEpC) cells, and liver-tropic BxPC-3 or BxPC-38;KD exosome- 
conditioned Kupffer cells. Red represents high and green represents low 
expression (1 = 3 in two independent experiments). Scale bars, 5 mm (a, top, 
d, e) and 50 um (b, bottom). Data are mean + s.e.m. * P< 0.05; **P< 0.01; 
*** P< 0.001 by one-way ANOVA (b, ¢, e); ***P < 0.001 by two-tailed 
Student’s t-test (d). 


with S$100A4-positive fibroblasts and surfactant protein C (SPC)- 
positive epithelial cells (40% and 30% of exosome-positive cells, 
respectively) in the lung (Fig. 2c and Extended Data Figs 4a and 
5a, top). By contrast, pancreatic cancer exosomes derived from 
BxPC-3-LiT cells fused with Kupffer cells”° (90% of exosome- 
positive cells; Fig. 2c and Extended Data Figs 4a and 5a, middle). 
Finally, 831-BrT exosomes interacted mainly with CD31-positive brain 
endothelial cells (98% of exosome-positive cells; Fig. 2c and Extended 
Data Figs 4a and 5a, bottom). Collectively, these data demonstrate 
that specific tissue-resident stromal cells differentially uptake tumour 
exosomes in metastatic target organs. 

We proposed that the unique exosomal integrins may interact 
with cell-associated extracellular matrix (ECM), mediating exosome 
uptake in specific target organs. We found that 4175-LuT exosomes 
expressing ITGa,B, and ITGa,$, co-localized with S100A4-positive 
cells in laminin-rich lung microenvironments (Extended Data Fig. 5b, 
top). Meanwhile, ITGa,$,-expressing pancreatic BxPC-3-LiT 
exosomes co-localized with F4/80* macrophages in fibronectin-rich 
liver microenvironments (Extended Data Fig. 5b, bottom). Therefore, 
specific exosomal integrins may selectively adhere to ECM-enriched 
cellular areas in the lung and liver. 
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Figure 4 | Exosomal integrin expression as a potential predictor of 
patient organ-specific metastasis. a, Exosomal ITGA, levels in breast 
cancer patients who were metastasis-free at the time of blood draw. Amount 
of ITG£, per microgram of exosome in healthy control (Ctrl) subjects 

(n= 6); patients with ductal carcinoma in situ (DCIS) (n= 7), invasive 
breast cancer without relapse within three years (NED, no evidence of 
disease) (1 = 8), locoregional recurrence (LR) within three years (n= 2), 
bone metastasis within three years (n = 3), or lung metastasis within three 
years (n = 2). POD, progression of disease. b, Exosomal ITGa, in pancreatic 


Exosomal tropism requires ITGB, and ITG£,; 

We next asked whether manipulating the integrin cargo packaged 
into exosomes could impact metastatic organotropism. To test 
the requirement for exosomal ITG, in lung tropism, we knocked 
down ITG6, expression in 4175-LuT cells using short hairpin RNAs 
(shRNAs) (41758 ,KD; Extended Data Fig. 6a). We found a more than 
threefold reduction in labelled ITGB,KD exosomes in the lung com- 
pared with labelled control exosomes 24h after injection (Fig. 3a-c). 
To test the requirement for exosomal ITGB, binding to laminin for 
lung tropism, we blocked integrin binding using RGD and HYD-1 
peptides. Pre-incubation of 4175-LuT exosomes with HYD-1, which 
blocks laminin receptors”®, markedly reduced exosome uptake in 
the lung, whereas RGD, which blocks fibronectin receptors but not 
ITGB, (ref. 27), did not significantly alter exosome uptake in the 
lung (Fig. 3a—c). Pre-treatment of 4175-LuT exosomes with HYD-1 
peptide also prevented their uptake by WI-38 lung fibroblasts 
in vitro (Extended Data Fig. 6b). Conversely, ITGB , overexpression 
in 1833-BoT exosomes was sufficient to increase exosome uptake 
in lung (Extended Data Fig. 6c and Fig. 3d). These data demon- 
strate that integrins are responsible for organ-specific uptake of 
exosomes, and that ITGB, promotes tumour exosome adhesion 
within the lung. 

We next investigated whether ITGB,KD exosomes could modu- 
late the metastatic organotropism of 4175-LuT models. Knockdown 
of ITGB, was sufficient to reduce the lung metastatic capacity of 
4175-LuT cells (Fig. 3e). Education with 4175-LuT exosomes, but 
not ITGB,KD exosomes, rescued the metastatic ability of ITGB,KD 
cells, yielding metastasis similar to 4175-LuT cells (Fig. 3e and 
Extended Data Fig. 6d). Therefore, ITGB ,-expressing exosomes 
can confer lung-metastatic behaviour to cells with limited capacity 
to colonize the lung. Similarly, ITGB,, knockdown in BxPC-3-LiT 
exosomes decreased liver uptake by sevenfold compared with con- 
trol BxPC-3-LiT exosomes (Extended Data Fig. 6e, f). Moreover, 
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cancer patients who were metastasis-free at the time of blood draw. Amount 
of ITGa, per microgram of exosome in healthy control subjects (n = 13); 
patients with pancreatic cancer without relapse within three years (n= 14), 
or liver metastasis within three years (n = 13). c, Model of exosome- 
mediated organotropic tumour dissemination. Tumour-derived exosomes 
are uptaken by organ-specific resident cells in future metastatic organs based 
on integrin expression. Data are mean + s.e.m. *P < 0.05; **P< 0.01 by 
one-way ANOVA. 


pre-incubation with RGD peptide or anti-ITGa,8, antibody, but not 
HYD-1 peptide, significantly reduced BxPC-3-LiT and Pan02-LiT 
exosome adhesion to the liver (Extended Data Figs 6g and 7a, respec- 
tively). Importantly, RGD peptides also inhibited the education effect 
of Pan02-LiT exosomes, subsequently blocking pre-metastatic niche 
formation and liver metastasis (Extended Data Fig. 7b). Our data sup- 
port the hypothesis that local microenvironmental changes induced by 
specific exosomal cargo (that is, ITGB, or ITGB,) can dictate metastatic 
organotropism. 


Exosomal ITGs activate S100 genes 

To identify the downstream effects of exosomal interaction with 
target cells, Kupffer cells were educated with either BxPC-3-LiT or 
BxPC-3-LiT ITGB,KD exosomes every other day for two weeks. 
Unbiased analysis of gene expression by RNA sequencing in Kupffer 
cells identified 906 genes upregulated more than twofold after treat- 
ment with BxPC-3-LiT exosomes compared to BxPC-3-LiT ITGB,KD 
exosomes. Cell migration genes were the most prominently upregu- 
lated (twofold for 221 genes; fourfold for 42 genes). Of these, S100A8 
and S100P were upregulated more than fourfold (Fig. 3f GEO acces- 
sion GSE68919). Since pro-inflammatory S100 gene expression 
correlates with metastasis”*°, we analysed $100 genes in tumour 
exosome-educated lung WI-38 fibroblasts and in human bronchial 
epithelial cells (HBEpCs). Several S100 genes (S100A4, -A6, -A10, 
-A11, -A13 and -A16) were upregulated more than fivefold after 
WI-38 fibroblast treatment with 4175-LuT exosomes compared with 
4175-LuT ITGB,KD exosomes (Fig. 3f). Notably, $100 genes remained 
unchanged in HBEpCs treated with 4175-LuT exosomes (Fig. 3f). 
Moreover, exosome-treated lung fibroblasts proliferated and migrated 
more than controls (Extended Data Fig. 7c, d), which correlated with 
a higher frequency of $100A4' cells in the lungs after three weeks of 
education with 4175-LuT, but not 41758 ,KD, exosomes (Extended 
Data Fig. 7e). We then surveyed, by in-cell western blot analysis, ITGB , 
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signalling proteins in WI-38 fibroblasts treated with 4175-LuT or 
4175B,KD exosomes. Notably, only Src or phosphorylated Src (pSrc) 
levels increased in an exosomal ITGB ,-dependent manner (Extended 
Data Fig. 7f), consistent with the known roles of ITGa,B, in Src activa- 
tion and $100A4 expression’’. Therefore, in addition to their adhesive 
properties, exosomal integrins can activate Src and upregulate pro- 
migratory and pro-inflammatory $100 molecules in specific resident 
cells within distant tissue microenvironments, influencing the expres- 
sion of genes implicated in facilitating tumour metastasis. 


Exosomal ITGs as organotropism biomarkers 

Next we investigated whether exosomal integrin content could predict 
tumour progression. ITGB, levels were increased in the plasma of mice 
six weeks after orthotopic 4175-LuT cell injection into the mammary 
fat pad, but were significantly reduced after successful tumour resection 
(Extended Data Fig. 8a). Furthermore, we performed ELISA assays 
for plasma-derived exosomal integrins in patients with lung (ITGB,) 
or liver (ITGay, the binding partner of ITGB;) metastasis. We found 
increased ITG, levels in exosomes from patients with lung metastasis 
(regardless of tumour type) compared with patients with no metastasis 
or liver metastasis (Extended Data Fig. 8b). Exosomes isolated before 
metastasis from patients with breast cancer that progressed to develop 
lung metastasis (POD) expressed the highest levels of exosomal ITGB, 
(Fig. 4a). ITGa, was significantly increased in exosomes isolated from 
cancer patients with liver metastasis compared with patients with no 
metastasis or lung metastasis (Extended Data Fig. 8c). Finally, exosomal 
ITGa, levels at diagnosis were higher in patients with pancreatic cancer 
who developed liver metastasis than in those without liver metastasis 
within three years of diagnosis or in control subjects (Fig. 4b). Taken 
together, our data indicate that the specific exosomal integrins in breast 
and pancreatic cancer patient plasma correlate with and predict likely 
sites of metastasis. 


Discussion 

Since Stephen Paget’s hypothesis first emerged, many studies have 
focused on identifying cell-intrinsic determinants of organ-specific 
metastasis*”?>.2435, We now show that tumour-derived exosomes 
prepare a favourable microenvironment at future metastatic sites 
and mediate non-random patterns of metastasis. We identify 
determinants of exosome-mediated organ-specific conditioning 
that allow the redirection of metastasis. Previously, adhesion and 
ECM molecules, such as integrins, tenascin and periostin, were 
shown to promote metastasis of disseminating cancer cells**”. 
We define a specific repertoire of integrins expressed on tumour- 
derived exosomes, distinct from tumour cells, which dictates 
exosome adhesion to specific cell types and ECM molecules in par- 
ticular organs. Notably, exosomes expressing ITGa,f, specifically 
bind to Kupffer cells, mediating liver tropism, whereas exosomal 
ITGa,$, and ITGa,6, bind lung-resident fibroblasts and epithelial 
cells, governing lung tropism (Fig. 4c). 

Interestingly, bone-tropic exosomes expressed a limited integrin 
repertoire, but were capable of inducing vascular leakiness in the 
lung despite lack of uptake in the lung parenchyma. These results 
suggest that whereas induction of vascular leakiness may be the first 
exosome-mediated step during the metastatic cascade, it is insuffi- 
cient to promote metastasis. Thus, integrin-independent mechanisms 
may mediate vascular leakiness and exosome involvement in bone 
metastasis. 

Cell-type-specific exosome integrin uptake promoted pro-migratory 
and pro-inflammatory S100 gene upregulation (S100A4, -A6, -A10, 
-A11, -A13 and -A16 in lung fibroblasts; S100P and -A8 in Kupffer 
cells). Notably, tumour exosomes failed to elicit $100 upregulation in 
lung epithelial cells, highlighting the cell-type specificity of exosomal 
education. Since $100A4 regulates lung metastasis”’ and is controlled 
by ITGa,B, (ref. 41), we conclude that exosomal ITGa,, activates 
the Src-S100A4 axis in lung fibroblasts during pre-metastatic niche 
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formation. Therefore, we propose that exosomal integrins not only 
promote adhesion, but also trigger signalling pathways and inflam- 
matory responses in target cells resulting in the education of that 
organ and rendering it permissive for the growth of metastatic cells. 

We provide the proof-of-principle that integrin-blocking decoy 
peptides successfully ablate tumour exosome adhesion in an integrin- 
specific and organ-specific manner. Thus, it is no longer surprising that 
targeting ITGa, in breast cancer cells prevented metastasis to other 
organs but not to the lung*” “*. However, strategies targeting exosomal 
integrins may effectively block organ-specific metastasis. Collectively, 
our data suggest that exosomal integrins and exosome-inducible 
S100 molecules in target cells represent candidates for anti-metastatic 
combination therapies. 

Overall, our findings suggest that circulating tumour- derived 
exosomes may be useful not only to predict metastatic propensity’, but 
also to determine organ sites of future metastasis. We believe exosomes 
perform distinct roles during each of the sequential steps (that is, 
vascular leakiness, stromal cell education at organotropic sites, 
bone-marrow-derived cell education and recruitment) necessary to 
complete pre-metastatic niche evolution''”*”. 

Future studies will focus on identifying exosomal integrins and pro- 
teins that could dictate metastasis to other organs, as well as further 
exploring the potential of exosomal ITGa,f, as a marker and driver of 
all cancer metastasis. Our findings demonstrate an important role for 
exosomes in dictating organ-specific metastasis, thus providing a basis 
for deciphering the mystery of organotropism. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Cell lines and cell culture. The cell lines used in this study were provided as follows: 
human breast cancer MDA-MB-231 organotropic lines 4175, 1833 and 831 by 
J. Massagué; human breast cancer 4173 and 4180 cells by A. Minn; human breast 
cancer 231BR cells by P. Steeg; liver metastasis enriched uveal melanoma cells by 
V. Rajasekhar; human osteosarcoma 143B cells by A. Narendran; human melanoma 
131/4-5B2 and 131/8-2L cells by R. Kerbel; human melanoma SB1B cells by C. F. 
Verschraegen; human rhabdomyosarcoma CT10 and RD cells by from R. Gladdy; 
and human Wilms tumour CCG9911 and CLS1 cells by A. Ketsis. Human breast 
cancer cell lines MDA-MB-231 and MDA-MB-468, human breast epithelial cells 
MCF10A, human pancreatic cancer cell lines, gastric cancer cell lines and colorectal 
cancer cell lines were purchased from American Type Culture Collection (ATCC). 
Although HT29 is commonly misidentified, we purchased this cell line directly from 
ATCC and the cell line was certified by this repository, therefore we are confident 
that it is indeed a colon cancer cell line. The C57BL/6 mouse pancreatic adenocarci- 
noma Pan02 was purchased from the National Cancer Institute Tumour Repository 
(DTP/DCTD, Frederick National Laboratory for Cancer Research). For in vitro edu- 
cation of human lung fibroblasts WI-38 (ATCC), human bronchial epithelial cells 
HBEpC (PromoCell), and human Kupffer cells (Life Technologies), cells were main- 
tained in culture for 14 days, with media containing 0, 5 or 10 ug ml! of exosomes, 
replenished every other day. Kupffer cells were cultured in RPMI and WI-38 cells 
were cultured in alpha-MEM, both supplemented with 10% exosome-depleted 
FBS (Gibco, Thermo Fisher Scientific) and penicillin-streptomycin. HBEpC cells 
were cultured in airway epithelial cell growth medium (PromoCell). All cells were 
maintained in a humidified incubator with 5% CO, at 37°C. FBS was depleted of 
bovine exosomes by ultracentrifugation at 100,000g for 70 min. All cells lines were 
routinely tested for mycoplasma and were found to be negative. 

Exosome purification, characterization and analyses. Exosomes were purified by 
sequential centrifugation as previously described”. In brief, cells were removed from 
3-4-day cell culture supernatant by centrifugation at 500g for 10 min to remove any 
cell contamination. To remove any possible apoptotic bodies and large cell debris, 
the supernatants were then spun at 12,000g for 20 min. Finally, exosomes were col- 
lected by spinning at 100,000g for 70 min. Exosomes were washed in 20 ml PBS and 
pelleted again by ultracentrifugation (Beckman 70Ti rotor). Exosome preparations 
were verified by electron microscopy. Exosome size and particle number were ana- 
lysed using the LM10 or DS500 nanoparticle characterization system (NanoSight, 
Malvern Instruments) equipped with a blue laser (405 nm). Normal mammary fat 
pad tissue-derived exosomes were obtained by culturing five mammary fat pads 
isolated from healthy 4-6-week-old C57BL/6 mice in 3 ml of FBS-free RPMI for 
12h. The final exosome pellet was resuspended in PBS and protein concentration 
was measured by BCA (Pierce, Thermo Fisher Scientific). 

Proteomics analysis. Mass spectrometry analyses of exosomes were performed at 
the Rockefeller University Proteomics Resource Center using 20 ug of exosomal 
protein. Samples were denatured using 8 M urea, reduced using 10 mM dithioth- 
reitol (DTT), and alkylated using 100 mM iodoacetamide. This was followed by 
proteolytic digestion with endoproteinase LysC (Wako Chemicals) overnight at 
room temperature, and subsequent digestion with trypsin (Promega) for 5h at 
37°C. The digestion was quenched with formic acid and resulting peptide mixtures 
were desalted using in-house made C18 Empore (3M) StAGE tips’”. Samples were 
dried and solubilized in the sample loading buffer containing 2% acetonitrile and 
2% formic acid. Approximately 3-5 ug of each sample was analysed by reversed 
phase nano-liquid chromatography—tandem mass spectrometry (LC-MS/MS) 
(Ultimate 3000 coupled to QExactive, Thermo Scientific). After loading onto the 
C18 trap column (5 um beads, Thermo Scientific) at a flow rate of 3 ul min”, 
peptides were separated using a 75-\1m inner diameter C18 column (3 um beads, 
Nikkyo Technos Co.) at a flow rate of 200 nl min‘, with a gradient increasing 
from 5% buffer B (0.1% formic acid in acetonitrile)/95% buffer A (0.1% formic 
acid) to 40% buffer B/60% buffer A, over 140 min. All LC-MS/MS experiments 
were performed in data-dependent mode. Precursor mass spectra were recorded 
in a 300-1,400 m/z mass range at 70,000 resolution, and 17,500 resolution for 
fragment ions (lowest mass: m/z 100). Data were recorded in profile mode. Up to 
20 precursors per cycle were selected for fragmentation and dynamic exclusion 
was set to 45 s. Normalized collision energy was set to 27. 

Semi-quantitative data analysis. MS/MS spectra were extracted and searched 
against Uniprot complete human or mouse proteome databases (January 2013) 
concatenated with common contaminants” using Proteome Discoverer 1.4 
(Thermo Scientific) and Mascot 2.4 (Matrix Science). All cysteines were consid- 
ered alkylated with acetamide. N-terminal glutamate to pyroglutamate conver- 
sion, oxidation of methionine, and protein N-terminal acetylation were allowed 
as variable modifications. Data were first searched using fully tryptic constraints. 
Matched peptides were filtered using a Percolator”? based 1% false discovery rate 
(FDR). Spectra not being matched at a FDR of 1% or better were re-searched 
allowing for semi-tryptic peptides. The average area of the three most abundant 


peptides for a matched protein” was used to gauge protein amounts within and 
between samples. 

Label-free quantitative mass spectrometry. LC-MS/MS data from three technical 

replicates of six organ-tropic samples were analysed using MaxQuant (version 
1.5.0.30) and Perseus software (version 1.5.0.9)5!, searching against a Uniprot 
human database (July 2014). Oxidation of methionine and protein N-terminal 
acetylation were allowed as variable modifications, and cysteine carbamidomethyl 
was set as a fixed modification. Two missed cleavages were allowed for specificity: 
trypsin/P. The ‘match between runs’ option was enabled. FDR values at the protein 
and peptide level were set to 1%. Protein abundance is expressed as LFQ values. 
Only proteins quantified in at least two out of three replicates in at least one group 
were retained, and missing values were imputed. A multiple sample ANOVA test 
was performed and corrected for multiple hypotheses testing using a permuta- 
tion-based FDR threshold of 0.05. 
Exosome treatment and labelling. To assess lung, liver and bone exosome dis- 
tribution, exosomes were injected via the retro-orbital venous sinus, the tail vein 
or intracardially. Exosome distribution patterns were consistent regardless of the 
route of injection. For brain distribution, exosomes were only observed in the 
brain after intracardiac injection. For 24-h exosome treatments, 10 tg of total 
exosomal protein were injected via the retro-orbital venous sinus, the tail vein, 
or intracardially in a total volume of 100 ul PBS. For exosome-tracking purposes, 
purified exosomes were fluorescently labelled using PKH67 (green) or PKH26 
(red) membrane dye (Sigma-Aldrich) or FM1-43FX dye (Life Technologies) for 
the photo-conversion experiment. Labelled exosomes were washed in 20 ml of PBS, 
collected by ultracentrifugation and resuspended in PBS. When performing pep- 
tide blocking experiments, exosomes were incubated with 0.06 1M RGD or HYD-1 
(peptide sequence: KIKMVISWKG) peptides for 30 min at 37°C before exosome 
injection. An average of five random fields was counted per sample at 20x magnifi- 
cation, and representative pictures were taken at 40x magnification. For education 
experiments, mice received 10 ug of exosomes retro-orbitally every other day for 
3 weeks. To measure exosome uptake by specific cell types, labelled exosomes 
were injected 24h before tissue collection and tissues were analysed for exosome- 
positive cells by immunofluorescence. Pictures were taken at 60 x magnifica- 
tion. For in vitro uptake assays, the membrane of WI-38 cells was labelled with 
PKH67 dye while 4175-LuT exosomes were labelled with PKH26 dye. Exosomes 
(10ug ml‘) were first incubated with PBS or HYD-1 peptide for 30 min at 37°C, 
followed by an incubation for 1 h with WI-38 cells at 37 °C. Excess exosomes 
were washed off and pictures were taken by Nikon confocal microscope (Eclipse 
TE2000U). The amount of exosomes localizing to the lung was analysed by 
immunofluorescence or using the Odyssey imaging system (LI-COR Biosciences). 
In brief, NIR dye-labelled exosomes were injected 24h before tissue collection and 
tissues were analysed for exosome-positive areas. Whole-lung images were ana- 
lysed using image J software, quantifying red fluorescence area in arbitrary units. 
Photoconversion and electron microscopy processing. Cryostat sections prepared 
at a 15-um thickness were placed on glass slides and re-fixed in 0.075 M sodium 
cacodylate, pH 7.4, containing 2.5% glutaraldehyde. For photoconversion, slides 
were washed twice in 0.1 M sodium cacodylate buffer, pH 7.4. Autofluorescence 
was quenched using 100mM NH, (Cl in cacodylate buffer for 45 min. On the basis 
of optimization experiments, sections were photoconverted for 2 h by incubation 
in 5.4mg ml ! 3,3/-diaminobenzidine in 0.1 M sodium cacodylate buffer, pH 7.4, 
and exposure to the light of an Intensilight C-HGFI 130-W mercury lamp and a 
4x/0.1 NA objective (Nikon Inverted Microscope Eclipse Ti). 

For electron microscopy processing, sections were post-fixed in 1% osmium 
tetroxide buffer for 15 min on ice. After washing with water, slides were placed in 
1% aqueous uranyl acetate for 30 min. Sections were washed with water, dehydrated 
in a graded series of ethanol concentrations and subsequently in acetone for 10 min 
at room temperature. Samples were embedded in Eponate. Serial sections were cut 
at 70 nm in thickness and transferred to formvar-coated slot grids and imaged on 
a JEOL 100CX at 80 kV with an AMT XR41 digital imaging system. 

Gene expression analysis. Cell lines were analysed for specific genes using 
pre-designed TaqMan assays (Applied Biosystems). In brief, RNA was extracted 
from tissues or cells using the RNeasy kit (Qiagen), and reverse transcribed using 
Superscript Vilo (Life Technologies). RT-PCR was performed on a 7500 Fast Real 
Time PCR System (Applied Biosystems), using TaqMan Universal PCR Master 
Mix (Applied Biosystems). Relative expression was normalized to B-actin levels. 

Knockdown and overexpression cell preparation. For shRNA-mediated knock- 
down of ITGf, and ITGB., specific interfering lentiviral vectors containing GFP 
reporter and puromycin resistance gene cassettes were used. In brief, oligonucleo- 
tide 5'-CCGGGAGGGTGTCATCACCATTGAACTCGAGTTCAATGGTGATG 
ACACCCTCTTTTTG-3’ targeting the 5’-GAGGGTGTCATCACCATTGAA-3’ 
sequence in the human ITGB4 gene (EntrezGene ID: 3691) or oligonucleotide 
5!-CCGGAGCTTGTTGTCCCAATGAAATCTCGAGATTTCATTGGGACAAC 
AAGCTTTTTTG-3’ targeting the 5’-AGCTTGTTGTCCCAATGAAAT-3’ 
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sequence in the human ITGB5 gene (EntrezGene ID: 3693) were cloned into the 
pLKO.1 vector. As a control, we used the empty pLKO.1 vector. For retrovirus 
production for integrin overexpression, the pWZL and pBabe vectors systems were 
used. pWZL-hygro-ITGB, and pBabe-puro-ITGB, were provided by F. Giancotti. 
Lentiviral and retroviral particles were packaged using 293T cells. Infected tar- 
get cells were selected using 500 ug ml hygromycin B or 2g ml | puromycin 
(Invitrogen). 

Flow cytometry analysis. Bone marrow was prepared for flow cytometry as pre- 
viously described’. For analysis of lung, tissues were minced and then digested at 
37°C for 20 min with an enzyme cocktail (collagenase A, dispase and DNasel, Roche 
Applied Science). Single-cell suspensions were prepared by filtering through a 70-um 
strainer and passing through an 18G syringe. Lung fibroblasts were identified by flow 
cytometry using an anti-mouse rabbit polyclonal $100A4 (1:50, Abcam; ab27957), 
or SPC (1:100, Santa Cruz; FL-197), revealed by Alexa Fluor 568-conjugated goat 
anti-rabbit secondary (A-11011, Life Technologies, 1:400). For liver, tissues were 
mechanically dissociated, and single-cell suspensions were filtered through a 40-um 
strainer. Allophycocyanin-conjugated F4/80 (1:100, eBioscience; clone BM8) was used 
to identify liver macrophages by flow cytometry. Cell fluorescence indicating fluores- 
cently labelled exosome uptake was analysed using a FACSCalibur or a FACSCanto 
(Beckton Dickinson). FACS data was analysed with FlowJo software (TreeStar Inc.). 
Migration assay. Twenty-thousand cells were plated in 24-well transwell plates 
with inserts (8-um pore size, Corning) and were incubated at 37°C for 6h. Cell 
inserts were fixed with 4% paraformaldehyde (PFA) for 10 min, followed by PBS 
wash and haematoxylin staining to allow visualization and counting. Nine random 
fields were counted per well at 20x magnification and the average number of 
migrated cells per field was calculated. 

Human studies. Human peripheral blood samples were obtained from control 
healthy subjects and cancer patients with lung or liver metastasis, or from patients 
without distant metastasis at Weill Cornell Medical College, University Medical 
Center Hamburg-Eppendorf, Oslo University Hospital, Memorial Sloan Kettering 
Cancer Center and University of Nebraska Medical Center, all pathologically 
confirmed. All individuals provided informed consent for blood donation on 
approved institutional protocols (WCMC IRB 0604008488 (DL), MSKCC IRB 
12-137A (JB)). Plasma or serum exosomes were isolated as previously described’. 
ITG6, and ITGa, levels in exosomes were measured by ELISA (ABIN417641 and 
ABIN417609 from Antibodies Online, and LS-F7188 from LifeSpan Biosciences), 
using 2 ug of exosomes per 100 ul of sample diluent, in duplicate reactions, accord- 
ing to the manufacturer's instructions. 

Mouse studies. All mouse work was performed in accordance with institutional, 
IACUC and AAALAS guidelines, by the animal protocol 0709-666A. All animals 
were monitored for abnormal tissue growth or ill effects according to AAALAS 
guidelines and euthanized if excessive deterioration of animal health was observed. 
No statistical method was used to pre-determine sample size. No method of rand- 
omization was used to allocate animals to experimental groups. The investigators 
were not blinded to allocation during experiments and outcome assessment. Mice 
that died before the predetermined end of the experiment were excluded from the 
analysis. In none of the experiments did tumours exceed the maximum volume 
allowed according to our IACUC protocol, specifically 2 cm’. For exosome localiza- 
tion, education and tumour implantation experiments for mouse cell lines, 6-week- 
old C57BL/6 Mus musculus females purchased from Jackson labs were used. For 
exosome localization, education and tumour implantation experiments for human 
cell lines, 6-8-week-old NCr nude (NCRNU-F sp./sp.) females purchased from 
Taconic were used. For lung metastasis studies using organotropic lines, 6-8-week- 
old nude female mice were pre-educated with exosomes for 3 weeks followed by 
tail vein injection of 2 x 10° or intracardiac injection of 1 x 10° luciferase-positive 
cancer cells resuspended in 100 ul PBS. Four weeks after intracardiac injection and 
eight weeks after tail vein injection, lung metastasis was measured using the IVIS 
200 bioluminescence imaging system (Xenogen, Caliper Life Sciences), and tissues 
were cut in 6-um sections and stained with haematoxylin and eosin for histology. 
To analyse the role of exosome education in tumour metastasis, 6-8-week-old 
C57BL/6 female mice pre-educated with pancreatic cancer-derived exosomes were 
injected intrasplenically with 1 x 10° Pan02 mCherry cells resuspended in 30 ul of 
Matrigel (Corning). One or twenty-one days later, mice were euthanized, and livers 
were analysed for metastatic lesions by measuring liver weight. 

To follow the levels of tumour-derived exosomes in plasma of tumour-bearing 
mice, 1 x 10° 4175 lung-tropic cells were injected in the mammary fat pad of nude 
mice. Mouse blood (250 ul) was drawn from the retro-orbital sinus when tumour 
size was over 800 mm‘°, followed by tumour resection. One week after the tumour 
was resected, mice were analysed by bioluminescence IVIS imaging for luciferase 
activity and separated into two groups: recurrence/tumour-free and recurrent 
tumours. Mouse blood was drawn and the plasma of mice within the same group 
was pooled for exosome isolation. Western blot analysis with anti-human ITGB, 
antibodies was used to detect tumour-derived exosomes. 
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To assess exosome-induced vascular leakiness, 10 ug of total exosome pro- 
tein were injected by retro-orbital injection. Then 20h after exosome treatment, 
mice were injected with 2 mg of Texas Red-lysine fixable dextran 70,000 MW 
(Invitrogen) via retro-orbital injection. One hour after dextran injection, mice were 
euthanized and perfused with PBS. Lungs were dissected and fixed in a mix of 2% 
PFA and 20% sucrose overnight, then embedded in Tissue-tek O.C.T. embedding 
compound (Electron Microscopy Sciences) and frozen in a dry-ice/ethanol bath. 
O.C.T. blocks were sectioned and stained for DAPI, pictures were taken using a 
Nikon confocal microscope (Eclipse TE2000U). Images were analysed using image 
J software, quantifying red fluorescence area in arbitrary units. 

Tissue processing and immunofluorescence. For histological analysis, tissues 
were dissected and fixed in a mix of 2% PFA and 20% sucrose in PBS overnight, 
then embedded in Tissue-tek O.C.T. embedding compound. Blocks were frozen in 
a dry-ice/ethanol bath. For immunofluorescence, 6 1m O.C.T tissue cryosections 
were stained with antibodies against F4/80 (1:100, eBioscience; BM8), fibronectin 
(1:50, Santa Cruz; IST-9), S$100A4 (1:100, Abcam; ab27957), SPC (1:100, Santa 
Cruz; FL-197), laminin (1:50, abcam; ab11575), CD31 (1:100, Santa Cruz; MEC 
13.3), EpCAM (1:50, Santa Cruz; HEA125). Secondary antibodies conjugated 
to Alexa Fluor 488 or 549 were used (A-11001 and A-11007, Life Technologies). 
Fluorescent images were obtained using a Nikon confocal microscope (Eclipse 
TE2000U) and analysed using Nikon software (EZ-C1 3.6). 

Western blot analysis. Exosomes or cells were lysed with RIPA buffer containing 
a complete protease inhibitor tablet (Roche). Lysates were cleared by centrifu- 
gation at 14,000g for 20 min. Supernatant fractions were used for western blot. 
Samples were separated on a Novex 4-12% Bis-Tris Plus Gel (Life Technologies), 
and transferred onto a PVDF membrane (Millipore). Membranes were processed 
for Ponceau red staining followed by 1h blocking and primary antibody incu- 
bation. The antibodies against the following proteins were used for western blot 
analysis: ITGB, (1:1,000, Cell Signaling; 4706), ITGB, (1:500, Cell Signaling; 
4707), ITGag (1:1,000, Cell Signaling; 3750), ITGa, (1:10,000, abcam; ab133557), 
ITGa (1:1,000, abcam; ab190731), ITGa,, (1:500, abcam; ab117611), ITGB; 
(1:500, Cell Signaling; 4708), ITGB, (1:500, Millipore; AB2984) Alix (1:1,000, 
Cell Signaling; 3A9), and GAPDH (1:10,000, Cell Signaling; 14C10). Anti-rabbit 
IgG, horseradish peroxidase (HRP)-linked antibody (1:3,000, Cell Signaling; 
7074) and anti-mouse IgG, HRP-linked antibody (1:3,000, Cell Signaling; 7076) 
were used as secondary antibodies. 

In situ protein expression analysis (in-cell western assay, LI-COR). Cells were 
plated in a 96-well plate and treated with 10 ug ml | exosomes for 2h and then 
processed according to the protocol provided by the manufacturer. In brief, cells 
were fixed with 4% PFA and washed with 0.1% TritonX-100/PBS. Cells were then 
blocked using Odyssey blocking buffer for 1h and stained overnight at 4°C with pri- 
mary antibody in Odyssey blocking buffer containing 0.1% Tween-20. The next day 
cells were washed again and incubated with LI-COR secondary antibodies for 1h 
at room temperature followed by fluorescent imaging using Odyssey. Antibodies 
against the following proteins were used: Src (1:100, Cell Signaling; 2109), p-Sre 
(1:100, Cell Signaling; 2101), AKT (1:100, Cell Signaling; 9272), p-AKT (1:100, Cell 
Signaling; 9271), p38 (1:100, Cell Signaling; 9212), p-p38 (1:100, Cell Signaling; 
9211), NF-«B (1:100, Cell Signaling; 3034), p-NF-«B (1:100, Cell Signaling; 3033), 
NFAT (1:100, Thermo Scientific; PA1-023), ILK (1:100, abcam; ab52480), FAK 
(1:100, abcam; ab40794) and GAPDH (1:100, Cell Signaling; 14C10). IRDye 
800CW anti-rabbit IgG (1:800, LI-COR) were used as secondary antibodies. 
Statistical analysis. Error bars in graphical data represent mean + s.e.m. Mouse 
experiments were performed in duplicate or triplicate, using 3-6 mice per treatment 
group. Statistical significance was determined using a two-tailed Student's t-test and 
one-way ANOVA, in which P values of P< 0.05 were considered statistically sig- 
nificant. Variance was similar between the groups that were statistically compared. 
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Extended Data Figure 1 | Characterization of organotropic exosome 
properties and biodistribution. a, Human cancer exosome biodistribution 
in lung and liver. Exosomes (10 ug) derived from each cell line were labelled 
with lipophilic PKH26 dye (red) and injected retro-orbitally into nude mice 
24h before culling. Top, representative NIR whole-lung image by Odyssey 
imaging (n= 3). Middle and bottom, represent exosome biodistribution 

in the lung and liver as determined by immunofluorescence microscopy. 
Arrows indicate exosome foci (n = 3, three independent experiments). 

b, Biodistribution of exosomes isolated from mouse cell lines E0771 and 
Pan02. Mouse exosome biodistribution in the lung and liver was determined 
by immunofluorescence microscopy. Exosomes (10 ug) derived from each 
cell line were labelled with lipophilic PKH26 dye (red) and injected retro- 
orbitally into nude mice 24h before culling. Top, lung at 40x magnification. 
Bottom, liver at 40x magnification. Arrows indicate exosome foci. Graph 
represents the quantification of exosome distribution by counting exosome- 
positive cells. An average of five random fields per sample were counted 

at 20x magnification (three independent experiments, each with n = 3). 
** P< 0.01 by two-tailed Student's t-test. c, Analysis of organotropic cell- 
derived exosomes. MDA-MB-231 organotropic cell-line-derived exosomes 


were analysed for size distribution by NanoSight and phenotype (purity 
and shape) by electron microscopy; black arrows indicate representative 
exosomes. Technical triplicates were analysed, at least 10 images per 
sample. d, Flow cytometric analysis of exosome’ cells in lung. Exosomes 
(10 ug) derived from MDA-MB-231 organotropic cell lines were labelled 
with lipophilic PKH67 dye (green) and injected retro-orbitally into nude 
mice 24h before culling. FITC-channel-positive cells were acquired on a 
FACS Calibur, and the percentage of exosome-positive cells was quantified 
(representing data pooled from two independent experiments, a total of 
n= 12).***P < 0.001 by one-way ANOVA. e, Flow cytometric analysis 

of exosome-positive cells in the bone marrow. Exosomes (10 1m) derived 
from MDA-MB-231 organotropic cell lines were labelled with lipophilic 
PKH67 dye (green) and injected retro-orbitally into nude mice 24h before 
culling. FITC-channel-positive cells were acquired on a FACS Calibur, and 
the percentage of exosome-positive cells was quantified (representative data 
pooled from two independent experiments, a total of n= 6). ** P< 0.01 and 
*P < 0.05 by one-way ANOVA for the 831-BrT to 1833-BoT and 4175-LuT 
comparisons, respectively. Data are mean + s.e.m. Scale bars, 5 mm (a, top), 
50 um (a, middle and bottom, b) and 100 nm (c). 
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Extended Data Figure 4 | Functional characterization of organotropic 
exosomes. a, Quantification of organotropic exosome uptake by target 

cells in vivo. Top graph, flow cytometric quantification of the frequency 

of 4175-LuT exosome-positive fibroblasts and epithelial cells (n= 4). 

Left bottom graph, flow cytometric quantification of the frequency of 
BxPC-3 exosome-positive macrophages (n= 3). Right bottom graph, 
quantification of the frequency of 831-BrT exosome-positive endothelial 
cells by immunofluorescence microscopy (n= 5). b, Organotropic cell-line- 
derived exosomes induce vascular leakiness in the lung. Leakiness in the 


lung 24h after retro-orbital injection of 10 ug of normal mammary fat pad or 
MDA-MB-231 organotropic cell-line-derived exosomes was quantified by 
imaging the presence of fluorescent dextran (red) outside of blood vessels, 

in the lung parenchyma. Left top panel, 40x magnification of representative 
lung image after PBS injection. Left bottom panel, representative lung image 
after 4175-LuT exosome injection. Scale bar, 50 um. Right graph depicts 

the quantification of five random areas at 20x magnification in arbitrary 
units (data representative of two independent experiments; n= 3). Data are 
mean + s.e.m. *P < 0.05; **P < 0.01 by one-way ANOVA. 
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Extended Data Figure 5 | Exosome co-localization with specific cell types 
within target tissues. a, Immunofluorescence analysis of resident cells in 
lung, liver and brain after labelled exosome injection. Analysis of exosome 
(red) co-staining with markers (green) for tissue-specific stromal cell types. 
Top, representative images of immunofluorescence microscopy of 4175-LuT 
exosome co-staining with F4/80, CD31 and EpCAM. Middle, liver sections 
from mice injected with BxPC-3-LiT-derived exosomes were co-stained 
with CD31, $100A4 and EpCAM. Bottom, brain sections from mice injected 
with 831-BrT exosome were co-stained with F4/80, S100A4 and EpCAM 


831+-BrT 


Laminin!S100A4 


(n= 3 per experiment for two independent experiments). b, Exosome 
biodistribution and co-localization with extracellular matrix proteins. 

Left top, representative immunofluorescence microscopy images of lung 
tissue, depicting 4175-LuT exosome (red) co-staining with laminin (green). 
Right top, laminin (green) co-staining with $100A4 (red). Left bottom, 
representative immunofluorescence microscopy of liver tissue co-stained 
for fibronectin (green) and BxPC-3-LiT exosomes (red). Right bottom, 
fibronectin (green) co-staining with F4/80 (red) (n = 3, two independent 
experiments). Scale bars, 30 um. 
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Extended Data Figure 6 | ITGs functionally regulate organotropic 
exosome uptake and exosome-mediated metastasis. a, Representative 
western blot analysis of integrin expression in 4175-LuT and 4175B,KD 

cells and exosomes (representative of three independent experiments). For 
western blot source data, see Supplementary Fig. 11. b, In vitro uptake of 
4175-LuT exosomes by WI-38 lung fibroblasts. The WI-38 cell membrane 
was labelled with PKH67 green dye and 4175-LuT exosomes were labelled 
with PKH26 red dye. Exosomes (10 tg ml *) were first incubated with PBS or 
HYD-1 peptide for 30 min at 37 °C, followed by 1-h incubation with WI-38 
cells at 37 °C. Excess exosomes were washed and cells were imaged (n= 4 

for two independent experiments). c, Representative western blot of ITGB, 
expression in exosomes isolated from wild-type or ITGB,-overexpressing 
1833-BoT cells (representative of two independent experiments). For western 
blot source data, see Supplementary Fig. 1m. d, Representative haematoxylin/ 
eosin staining of lungs from Fig. 3e. Arrows indicate lung metastasis; n = 6, 
data representative of two independent experiments. e, Representative 
western blot analysis of integrin expression in BxPC-3-LiT and BxPC-38;KD 
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cells and exosomes. For western blot source data, see Supplementary Fig. In. 
f, Immunofluorescence analysis of BxPC-3-LiT control and BxPC-3B,KD- 
derived exosome biodistribution in the liver. Exosomes (10 1g) isolated from 
each cell line were labelled with lipophilic PKH26 dye (red) and injected 
retro-orbitally into nude mice 24h before culling. Left, 40x magnification. 
Arrows indicate exosome foci. Scale bar, 50 um. Right, quantification of 
exosome distribution by exosome-positive cells. An average of five random 
fields were counted at 20x magnification (data representative of two 
independent experiments; n = 3). g, Flow cytometry analysis of exosome- 
positive cells in the liver 24h after exosome injection. Labelled BxPC-3-LiT 
exosomes (5 1g) per mouse were incubated with PBS, RGD, HYD-1 or 
ITGa,f,, antibody for 30 min at 37°C before retro-orbital injection into nude 
mice. Livers were collected and analysed for exosome-positive cells by flow 
cytometry 24h after injection (n = 4, except for the ITGa,B, antibody group, 
in which n=5). Scale bars, 10 um (b), 500 um (d) and 500 um (f). **P< 0.01, 
*** P< 0.001 by two-tailed Student's t-test (f) and one-way ANOVA (g). Data 
are mean + s.e.m. 
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Extended Data Figure 7 | Functional contribution of exosomes to 
metastasis. a, Microscopic analysis of exosome-positive cells in the livers 

of mice injected with liver metastatic Pan02-LiT-derived exosomes. Before 
injection, Pan02-LiT exosomes were pre-incubated with RGD peptide for 
30 min at 37°C. Pan02-LiT exosomes (101g) were labelled with lipophilic 
PKH67 green dye and injected retro-orbitally into C57BL/6 mice 24h before 
culling. Livers were digested and exosome-positive cells were quantified 

by flow cytometry (n= 3). b, Analysis of Pan02-LiT liver metastasis after 

3 weeks of continuous treatment with PBS, Pan02-LiT-derived exosomes, 
or Pan02-LiT-derived exosomes pre-incubated with RGD peptide for 

30 min at 37°C. Pan02-LiT cells were injected intrasplenically. Mice were 
injected retro-orbitally with 5 ug exosome every other day for 3 weeks. Top, 
representative liver images showing metastasis taken at culling. Bottom, 
liver weight quantification ( = 4 except for the control and peptide 

group for which n = 3 of one experiment). c, Functional analysis of lung 
fibroblasts educated with 4175-LuT-derived exosomes. Proliferation of lung 


fibroblasts educated with exosomes every other day for 2 weeks. Three days 
after cells were plated at equal density, cell numbers were counted using a 
haemocytometer (n= 3; three independent experiments). d, Migration of 
lung fibroblasts educated with exosomes every other day for 2 weeks was 
measured as follows. Fibroblasts were plated in 24-well transwell chamber 
inserts, and after 6 h the number of cells that migrated was counted 

using haematoxylin staining. Nine random fields were counted at 20x 
magnification and the average number of cells per field was calculated (total 
of n= 4 from two independent experiments). e, Representative image of the 
lung stained for $100A4. Mice were treated every other day with PBS, 4175- 
LuT or 41758,KD exosomes for 3 weeks. Scale bar, 50 um; n = 4 mice. f, In 
situ (in-cell western) protein expression analysis of WI-38 fibroblasts treated 
with PBS, 4175-LuT or 4175ITGB ,KD exosomes. Relative expression levels 
of Src and phosphorylated (p-) Src (n = 3, three independent experiments). 
Data are mean + s.e.m. *P < 0.05; **P< 0.01; ***P < 0.001 by two-tailed 
Student's t-test (a, c, d) and one-way ANOVA (b, f). 
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Extended Data Figure 8 | Exosomal integrin expression as a potential 
metastatic site biomarker. a, Exosomal ITG, levels in the plasma of 
mice bearing orthotopic 4175-LuT tumours, as a function of tumour 
progression. Blood plasma was collected for exosome isolation 6 weeks 
after intra-mammary fat pad tumour injection, then again 1 week after 
tumour resection, from mice that were deemed to be either free of tumour 
or presenting with recurring tumours based on IVIS bioluminescence 
imaging (n= 5 were pooled for each group, based on one experiment). For 
western blot source data, see Supplementary Fig. lo. b, Exosomal ITGB, in 
healthy control subjects (Ctrl) (n = 13); patients with breast cancer (BrCa) 
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and no metastasis (n = 3), liver metastasis (n= 1), or lung metastasis (n = 3); 
patients with rhabdomyosarcoma (RMS) and no metastasis (n = 1) or 

lung metastasis (n= 3); patients with pancreatic cancer (PDAC) with liver 
metastasis (1 = 14) and lung metastasis (n = 3); and patients with melanoma 
(Mel) with lung metastasis (n = 2). c, Exosomal ITGay in healthy control 
subjects (n = 13); patients with rhabdomyosarcoma and no metastasis (n = 1) 
or lung metastasis (n = 3); patients with breast cancer and lung metastasis 
(n= 3) or liver metastasis (n = 1); and patients with pancreatic cancer and 
liver metastasis (n = 15). Data are mean + s.e.m. *P < 0.05; *** P< 0.001 by 
one-way ANOVA. 
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Extended Data Table 1 | Integrin expression in human exosomes in multiple organotropic tumour models 
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Proteomic analysis of integrins in exosomes. ‘+’ indicates positive for integrin expression by qualitative mass spectrometry. 
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Extended Data Table 2 | Integrin expression in human and mouse cell-line-derived exosomes 
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Gating machinery of InsP3R channels 
revealed by electron cryomicroscopy 


Guizhen Fan', Matthew L. Baker”, Zhao Wang’, Mariah R. Baker', Pavel A. Sinyagovskiy', Wah Chiu’, 


Steven J. Ludtke? & Irina I. Serysheva! 


Inositol-1,4,5-trisphosphate receptors (InsP3Rs) are ubiquitous ion channels responsible for cytosolic Ca** signalling 
and essential for a broad array of cellular processes ranging from contraction to secretion, and from proliferation to cell 
death. Despite decades of research on InsP3Rs, a mechanistic understanding of their structure—-function relationship is 
lacking. Here we present the first, to our knowledge, near-atomic (4.7 A) resolution electron cryomicroscopy structure 
of the tetrameric mammalian type 1 InsP3R channel in its apo-state. At this resolution, we are able to trace 
unambiguously ~85% of the protein backbone, allowing us to identify the structural elements involved in gating and 
modulation of this 1.3-megadalton channel. Although the central Ca**-conduction pathway is similar to other ion 
channels, including the closely related ryanodine receptor, the cytosolic carboxy termini are uniquely arranged in a 
left-handed a-helical bundle, directly interacting with the amino-terminal domains of adjacent subunits. This 
configuration suggests a molecular mechanism for allosteric regulation of channel gating by intracellular signals. 


InsP3Rs belong to a superfamily of tetrameric cation channels that 
includes functionally distinct groups of ion channels (for example, 
K*, Na?", Ca?", transient receptor potential and cyclic nucleotide- 
gated channels). A common architectural feature of these channels is 
their central ion-permeation pore consisting of four membrane-span- 
ning subunits or domains. However, these channels are quite different 
with respect to their activation and ability to respond to extracellular 
and intracellular stimuli, defining their particular roles in a wide range 
of cellular processes. Localized within the membranes of intracellular 
Ca** stores such as the endoplasmic/sarcoplasmic reticulum, InsP3R 
channels have crucial roles in a variety of physiological functions, 
including gene transcription, fertilization, hormone secretion, meta- 
bolic regulation, immune responses, apoptosis, learning and memory. 
Their malfunction is associated with abnormal intracellular Ca7* 
levels linked to pathological conditions in humans, such as cardiac 
hypertrophy, heart failure, Alzheimer, Parkinson and Huntington 
diseases, cancer and stroke. 

Type 1 InsP3R (InsP3R1) exemplifies the family of InsP3-gated 
Ca?* release channels, which is comprised of three homologous iso- 
forms (types 1-3). InsP3R1 is the predominant Ca** release channel 
in cerebellar Purkinje cells and best characterized member of the 
family. Despite its profound importance in physiology, a mechanistic 
basis for InsP3R1 function has remained elusive, largely owing to the 
lack of a detailed architecture of the intact InsP3R channel. 

The quaternary structure of the entire InsP3R1 assembly was con- 
troversial’ until our previous intermediate resolution electron cryo- 
microscopy (cryo-EM) structure**. While this structure is critical in 
providing the foundation for further structure-function studies, the 
resolution was insufficient to reveal the mechanistic features under- 
lying the channel function. Furthermore, X-ray crystal structures for 
portions of InsP3R1 are available but limited to small soluble pieces 
(~15% of the 314-kilodalton protein)*’. Knowledge of the full-length 
InsP3R architecture with atomistic details is paramount to under- 
standing the molecular mechanism underlying channel function in 
both healthy and disease states. Here we present a cryo-EM structure 


and model of the entire InsP3R1 channel in its ligand-free state deter- 
mined at an overall 4.7 A resolution by single-particle cryo-EM. The 
structure reveals distinct features that allow us to infer its gating 
mechanism. 


Overall architecture of InsP3R1 


In this work, InsP3R1 from rat cerebellum was detergent-purified 
without adding InsP; or other channel-specific ligands, as previously 
described’. Before vitrification, InsP3R1 channel particles were stabi- 
lized by depletion of Ca?*, conditions that favour a closed state of the 
channel (Methods). The final map was reconstructed to 4.7 A resolu- 
tion (Extended Data Figs 1 and 2). The shape and overall dimensions 
of the tetrameric InsP3R1 3D density map are in excellent agreement 
with our previously published cryo-EM structure’: four subunits are 
arranged around a central channel axis forming two major regions, 
the bulky cytosolic region connected via ‘stalk’ densities to the trans- 
membrane region (Extended Data Fig. 3 and Supplementary Video 1). 

Resolution and resolvability of structural features are different in 
various parts of the map (Extended Data Fig. 2c) as suggested in 
our previous studies*’. These differences are probably, at least in 
part, due to genuine flexibility associated with particular domains. 
Additionally, the potential presence of InsP3R1 splice variants may 
introduce some degree of compositional heterogeneity within the 
purified samples*"°. Thus, it is conceivable that these regions com- 
prising alternatively spliced sequences represent some of the relatively 
poorly resolved regions of the map (Extended Data Figs 4a and 5). 
However, the transmembrane helices are well resolved in the map 
with many bulky side chains visible in the density map (Extended 
Data Fig. 6). Despite the observed variations in resolvability, we were 
able to model the backbone topology for 2,327 of 2,750 amino acids in 
the full-length InsP;R1 protein (Fig. 1). 


Subunit folds and tetrameric assembly 


The individual InsP;R1 subunit is made up of ten domains (Fig. 1b, c), 
arranged within the tetrameric channel assembly around a central 


1Department of Biochemistry and Molecular Biology, Structural Biology Imaging Center, The University of Texas Medical School at Houston, 6431 Fannin Street, Houston, Texas 77030, USA. National 
Center for Macromolecular Imaging, Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA. 
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Figure 1 | Overview of InsP3R1 structure. a, Structure of InsP3R1 visualized 
in two orthogonal orientations: view from cytosol and side view along the 
membrane plane. Subunits are colour-coded. b, Linear representation of 
InsP3R1 structural domains (GI accession 17380349): domains are annotated 


four-fold axis to fulfill specific functions, including channel stability, 
ion transport, ligand binding and regulation. In our model, ~90% of 
the protein sequence extends beyond the membrane into the cytosol 
including most of the N-terminal protein sequence and the 
C-terminal tail (Extended Data Fig. 5). The remaining portion 
encompasses the transmembrane and lumenal domains. 

At the amino terminus of each subunit, two contiguous f-trefoil 
domains ($-TF1 and B-TEF2; residues 1-436) form apical densities 
around the central four-fold axis of the cytosolic region (Fig. 1b, c, 
Extended Data Fig. 4c and Supplementary Video 2). After B-TF2, 
there is a clear o-helical pattern (residues 437-2192) forming three 
consecutive armadillo solenoid folds (ARM1-—ARM3), with an «-hel- 
ical domain between ARM1 and ARM2. After ARM3, the subunit 
extends into the ‘intervening lateral’ domain (ILD, residues 2193- 
2272) that contains two anti-parallel B-strands followed by a helix- 
turn-helix motif. The C terminus of the ILD is connected to the 
transmembrane region, comprised of six o-helices (TM1-TM6), a 
pore (P)-helix and three lumenal loops. The C-terminal domain 
(CTD) contains an ~80A long o-helix (residues 2681-2731) con- 
nected at its N terminus to TM6 via a helical linker domain (LNK, 
residues 2601-2680) (Fig. 1b-d). 

Unique to InsP3R1, the entire tetrameric architecture of the chan- 
nel is built around two four-helix bundles: one transmembrane and 
one cytosolic bundle that together form a central core along the four- 
fold axis (Fig. 1d). The transmembrane bundle is right-handed and 
formed by the TM6 helices (one from each subunit), whereas the 
cytosolic bundle contains the CTD «-helix from each subunit packed 
in a left-handed fashion. The latter spans most of the cytosolic region 
and earlier was described as the ‘plug’ density”. The transmembrane 
and cytosolic helical bundles are connected via four LNK domains 


2750 


Lumenal 
loop 


in the text, HD is the «-helical domain. c, An individual subunit colour-coded 
by domain. d, Central core structure of tetrameric InsP3R1. Four-fold axis is 
indicated by the dashed line; arrows indicate the bundle handedness. 


(one from each subunit), each consisting of two short, nearly ortho- 
gonal helices. 


Ca?* permeation pathway of InsP3R1 

The cryo-EM densities comprising the transmembrane region of 
InsP3R1 are the most-resolved portions of the map (Extended Data 
Fig. 2c). As such, it was possible to construct a complete model for 
the transmembrane domains (TMDs), which constitute the ion- 
permeation pathway along the central four-fold axis (Fig. 2a, b). 
The Ca** conduction path is lined by four TM6 helices, tilted 
~37 © relative to the membrane normal and packed in a right-handed 
bundle (Figs 2a, b and 3), a conformation originally observed in the 
KcesA channel!'. The TM6 helices are ~55 A long, extending beyond 
the cytosolic membrane surface and curving radially to form a taper- 
ing path for ions (Fig. 3a). The TMS5 helices are packed against the 
TM6 helices (Fig. 2 and Supplementary Video 2). The lumenal loop 
between the TM5 and TM6 helices contains a short P-helix (residues 
2531-2544) and a selectivity filter comprising highly conserved resi- 
dues 2546-2552 (Extended Data Figs 7a and 8c)’. The central bundle 
of TM5 and TM6 helices is linked to the flanking TM1-TM4 bundle 
via an amphipathic o-helix (TM4-5), which lies parallel to the cyto- 
solic membrane leaflet (Fig. 2). Mutations in the TM4-5 helix have 
been shown to affect the channel gating suggesting that this helix may 
influence the motions of the pore-lining helices of InsP3R (ref. 12). As 
observed in the other known tetrameric cation channels, TM4 cradles 
TMS of the adjacent subunit (Fig. 2a). 

The structure of the pore is wider at the lumenal face of the mem- 
brane, where the P-helices and selectivity filter are located (Fig. 3). 
His2541 is present at the C-terminal end of the P-helix and forms a 
ring of positive charges on the lumenal side, possibly repelling Ca** in 
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Figure 2 | Structure of the transmembrane domains. a, Arrangement of 
transmembrane helices in tetrameric InsP3R1; subunits are colour-coded 

and viewed from cytosol (left) and lumen (right). b, The structures of two 
opposing TMDs are superimposed on cryo-EM densities; view is along the 
membrane plane. One subunit is colour-coded by domain. ¢, Structural compa- 
rison of TMDs from InsP3R1 (left) and RyR1 (PDB accession 3J8H, middle); 
structurally conserved domains are shown overlapped (right); a root mean 
squared deviation (r.m.s.d.) value between the TMDs is 1.135 A for 77 atom pairs. 


the non-conducting channel. In order for Ca’* to pass beyond the 
lumenal vestibule, we propose that the P-helices may undergo a struc- 
tural rearrangement when the channel opens into a conductive state. 


Figure 3 | Detailed structure of the Ca?* conduction pathway. a, A bundle 
of TM6 helices shape the ion permeation pathway. A series of hydrophobic 
residues within a constriction region of the channel pore are labelled. The wire 
cage is the void-density along the ion conduction pathway; colour-coded by 
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Our structure suggests that the physical gate for ion-permeation is 
located at the point of constriction along the TM6 helices (Fig. 3 and 
Extended Data Fig. 6c). This constriction, located closer to the cyto- 
solic side of the membrane, includes a series of hydrophobic residues, 
Leu2582, Phe2586 and Ile2590, facing the Ca”* permeation pathway. 
At Phe2586, densities for the side chains are clearly visible and point 
towards the central axis, where they shape a pore of ~5 A in diameter, 
suggesting a non-conducting channel conformation given an effective 
diameter of the hydrated Ca?* (8-10 A)", Noteworthy, in this con- 
figuration the pore will also not be permissive for hydrated K* ions 
(6 A)'*. Equivalent hydrophobic constrictions have been identified in 
the structures of other channels'*-’’ (Supplementary Discussion). Just 
above the gate, a cytosolic vestibule formed by four TM6 helices has 
negatively charged residues that may facilitate Ca”* translocation into 
the cytosol (Extended Data Fig. 7a). Altogether, our data support the 
idea of structural conservation of tetrameric cation channel design 
(Supplementary Discussion). 


Cytosolic scaffolding architecture 


Perhaps the most prominent feature of the cytosolic region of 
InsP3RI1 is the solenoid-like architecture of three «-helical domains 
(ARM1-ARM3) formed by ensembles of armadillo repeats (Fig. 1 and 
Extended Data Fig. 4c, d). The modular architecture of the ARM 
domains in InsP3R1 is particularly amenable for generating different 
interfaces for recognition and binding of a large set of modulatory 
proteins whose putative binding sites have been previously iden- 
tified”? (Extended Data Fig. 4a). Thus, the flexible architecture of 
the ARM domains probably facilitates propagation of ligand-evoked 
signals towards the ion-conduction pathway. 

The ARM domains constitute peripheral densities of the cytosolic 
region that form a concave surface around the central helical bundle 
(Fig. 1c). ARM1 (residues 436-714) is formed by the stacking of five 
armadillo repeats (Extended Data Fig. 5). The first two armadillo 
repeats of the ARM1 and preceding two B-TF domains (Fig. 4a-c) 
constitute the ligand-binding domain (LBD)*’. The B-TF1 domain 
(residues 5-225) was termed the InsP3-binding suppressor domain, 
and B-TF2 (residues 226-435) with two a-helices (residues 436-604) 
of ARM1 constitutes the InsP3-binding core (IBC) region, also desig- 
nated as B-IBC and «-IBC subdomains, respectively (Fig. 4a—-c and 
Extended Data Fig. 5). The suppressor domain, B-IBC and «-IBC of 
each subunit form a triangular structure similar to previously 
reported crystal structures®’ (Extended Data Fig. 4b). The four 
LBDs constitute a cytoplasmic apical density around the CTD bundle 
(Fig. 4a). The ARM2 (residues 1030-1494) and ARM3 domains (resi- 
dues 1594-2192) include ten and six armadillo repeats, respectively, 
and together with the helical domain (residues 715-1029), connect 
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hydrophobicity (right). b, The lumenal vestibule lined by P-helices and 
selectivity filter (SF) loops viewed along the four-fold axis from the lumen; 
insert shows the close-up view. 
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Figure 4 | Structural coupling between the CTD and InsP3-binding 
domains. a, Cryo-EM density map of InsP,R1 viewed from the cytosol with 
LBDs and CTDs colour-coded by subunit. b, InsP3R1 model corresponding to the 
region marked in a; blue stars denote InsP3-binding sites; purple spheres denote 
Tyr167. c, Contacts between LBD and CTD of adjacent subunits. Domains are 
denoted as ellipses colour-coded as in Fig. 1b. Residues involved in InsP, 
binding*” are rendered as blue spheres. d, The interface between B-TF2 and 
B-TF1 of neighbouring subunits viewed orthogonally with respect to c. Residues 
on B-TF2 domain within 5 A from hotspot loop of B-TF1 are coloured green. 


the InsP3-binding domains to the channel-forming region. It is 
notable, that density in ARM2 is less resolved, suggesting higher 
structural flexibility (Extended Data Fig. 2c). The entire cytosolic 
solenoid scaffold in the tetrameric InsP3R1 communicates with the 
TMDs via the central helical bundle and the ILD (Fig. 1b, c and 
Supplementary Video 2). 


Gating via C- and N-terminal coupling 

A functional hallmark of InsP3R channels is the complex interplay 
between the binding of primary ligands (InsP3 and Ca**) and channel 
gating. In addition, several intracellular regulatory molecules interact 
with InsP3R in a dynamic manner providing functional response in 
the channel”® (Extended Data Fig. 4a). Our structure suggests that 
allosteric modulation of InsP3R gating by intracellular signals can 
occur via several different initiation points that mechanically couple 
these signals to a common ion-conduction pathway conserved among 
a large group of ion channels (Extended Data Fig. 7b). 

Given the spatial separation between the InsP3-binding and ion- 
conduction domains within the tetrameric InsP3;R1 (Fig. lc and 
Extended Data Fig. 8a), there has to be a coupling mechanism for 
transmission of ligand-evoked signals to the pore to produce a specific 
gating event. In the crystal structures of the LBD, solved in both the 
InsP3-bound and apo-states®’, InsP; binding at the cleft between the 
B-IBC and a-IBC domains (Fig. 4b, c) causes closure of the ‘clam-like’ 
InsP3-binding pocket and movement of the suppressor domain. The 
intra-subunit interface between the B-TF1 and B-TF2 domains was 
shown to be dynamic, allowing for B-TF1 to twist in response to InsP 
binding to the IBC®’. While precise conformational changes assoc- 
iated with InsP;-induced activation of the channel gating in the native 
tetrameric InsP3R1 are not yet known, these signals may propagate 
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Figure 5 | Cytosolic intra- and inter-subunit interactions. a, Inter-subunit 
contacts of B-TF1 with ARM2 and ARM3 domains; subunits are colour-coded. 
b, Two opposing subunits are viewed along the membrane plane; domains 
of one subunit are colour-coded. Transmembrane-cytosolic domain interface 
is indicated with a red dashed line (left), and its close-up view is shown in insert 
(middle right). The structure of LNK domain is superimposed on cryo-EM 
densities (bottom right); residues of the C2H2 Zn?" -finger-like motif are 
coloured blue. 


towards the TMDs via discrete intra- and inter-subunit interfaces 
identified in our structure (Figs 4 and 5). Our structure shows that 
the helix-turn-helix (%2-turn-«3) motif, probably communicates 
with the «72 and 076 helices of the ARM3 domain within the adjacent 
subunit (Fig. 5a and Extended Data Fig. 5). Because the C terminus of 
the ARM3 domain is connected to the TMD via the ILD, it is con- 
ceivable that interactions between the B-TF1 and ARM3 domains will 
propagate InsP; evoked conformational changes in the LBD to the 
pore (Fig. 6a). Importantly, suppressor domain deletion mutants of 
InsP3R1 have been shown not to exhibit any measurable InsP3- 
induced Ca" release”!, emphasizing its critical role in transmitting 
the InsP; binding signal. Furthermore, the ARM3 domain contains a 
putative Ca?*-sensor region (residues 1933-2271), where a highly 
conserved Glu2101 has been implicated in mediating channel 
gating*’’, However, no EF-hand Ca’* binding motif has been found 
in InsP3R1 (Extended Data Fig. 9). 

The B-TF1 domain also forms inter-subunit interactions with B-TF2 
and ARM2 domains. The loop of B-TF1 (residues 166-180) is posi- 
tioned to interact with non-contiguous regions of B-TF2 of the neigh- 
bouring subunit including residues 246-248, 373-376, 387-389 and 
426-429 (Fig. 4d). Mutation of Tyr167 was shown to impair InsP3- 
induced Ca*™ release but not affect InsP; binding to InsP;R1 (ref. 24), 
suggesting the importance of this inter-subunit interface in signal trans- 
mission. Furthermore, the B-TF1 loop (residues 136-139) makes addi- 
tional inter-subunit contacts with «44 and «47 helices of the ARM2 
domain (Fig. 5a), which specifies modulatory activities for InsP3- 
mediated Ca?~ release”® (Extended Data Fig. 4a). All together, this 
alludes to the importance of the B-TF1 domain in coupling InsP; bind- 
ing to activation of the channel gate. However, our structure shows no 
direct coupling between the LBD and TM4-5 as previously proposed”®, 
suggesting a more long-range mechanism for signal transmission. 
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Figure 6 | Model for coupling mechanism of InsP3R1 activation by InsP3. 
a, Schematic representation of inter-subunit contacts involved in propagation 
of InsP;-binding signal to the pore (colour-coded by subunit): from B-IBC 

to CTD/LNK domains and from the suppressor domain (SD) to ARM3/ILD 
domains of neighbouring subunits. b, The InsP3-induced changes in the LBDs 
can cause the helices in the cytosolic bundle to rearrange and trigger the 
motions of the LNKs, that may force the transmembrane bundle to adopt a 
conformation permeable for Ca?*; colour-coded by subunit. 


The CTD in InsP3R1 is comprised of a coiled-coil motif, forming a 
long helical bundle that spans almost the entire cytosolic assembly of 
the channel, unlike the CTD in ryanodine receptor 1 (RyR1)””” 
(Figs 1c, d, 2c and Extended Data Fig. 8a). The coiled-coil structure 
is potentially stabilized by a predicted saltbridge between adjacent 
subunits at residues Gln2700 and Lys2701 (Extended Data Fig. 10a). 
Furthermore, the CTD of one subunit interacts with the B-TF2 of the 
adjacent subunit by predominantly inter-domain electrostatic interac- 
tions (Fig. 4 and Extended Data Fig. 10b, c). This observation is con- 
sistent with earlier biochemical studies of InsP3R proposing that the 
N-terminal and C-terminal regions are probably in a close association 
in the native channel***'. In this arrangement, the cytosolic helical 
bundle is in a prime position to sense a signal resulting from InsP, 
binding and may undergo a conformational change playing a critical 
role in transmitting the InsP; signal to the TMDs (Figs 4 and 6). 

The CTDs are connected to LNK domains that form a connecting 
ring between the cytosolic and transmembrane bundles and essentially 
establish continuous communication between the N-terminal LBDs 
and the pore-lining TM6 helices (Figs 1c, d and 5b). Noteworthy, the 
LNK domain of InsP3R1 is structurally analogous to the CTD domain 
(residues 4957-5037) in RyRI1 (refs 21, 32). Both domains contain a 
Cys2His2 (C2H2) Zn** -finger motif, although in InsP3R1 there are 16 
residues between Cys2614 and His2631 rather than the typical 12 
residues (Fig. 5b and Extended Data Fig. 7a). While modulation of 
the InsP;R channel function has not been investigated regarding its 
sensitivity to Zn’, this domain may have a role in coordinating metals 
that influence channel gating. Mutational studies of the C2H2 residues 
were shown to either inhibit (Cys2611Ser, Cys2614Ser, His2636Ala) or 
completely abolish (His2631Ala) channel function, indicating that the 
LNK domain is positioned to be a crucial structural component for 
allosteric modulation of channel gating”. 

This region also contains the sequence (residues 2630-2655) pro- 
posed to be essential for InsP3R tetramerization” (Extended Data Fig. 
7a). A predicted hydrogen bond between Asn2241 and Glu2617 in 
our structure may bridge the LNK and ILD of neighbouring subunits, 
possibly having a modest role in channel formation. However, the 
larger role of the LNK is not entirely evident from our structure given 
the lack of direct inter-subunit contacts of the LNK domains in the 
tetrameric InsP3R1 assembly. Rather, the combined interactions 
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throughout the entire central core structure of InsP3R1 may serve 
to stabilize the tetrameric interfaces providing structural and func- 
tional integrity of InsP3R channel (Fig. 6b). 

The dominant role of the CTD in transmitting the conformational 
changes triggered by InsP binding to channel gate opening is further 
supported by an earlier study demonstrating that deletion of 43 resi- 
dues from the CTD disrupts channel gating’®. Furthermore, in a chi- 
maeric channel, comprising the TMD of RyR1 and the cytosolic 
domains of InsP;R1, the InsP; efficacy is substantially decreased while 
InsP, binding is minimally affected’. It is conceivable that the lack of 
the InsP;R1 CTD bundle in the chimaeric channel causes uncoupling 
of InsP3-evoked conformational changes to the channel gating. The 
unique CTD architecture of InsP3R1 suggests that the two families of 
Ca*” release channels explore different mechanisms for transmitting 
ligand-evoked signals to the ion-permeation pore (Supplementary 
Discussion). Our hypothetical model for transmission of the InsP3- 
evoked signal is schematically shown in Fig. 6. Again, while the precise 
InsP3-induced conformational changes remain to be revealed, the 
channel activation relies on concerted inter- and intra-domain inter- 
actions mediated in the context of the tetrameric InsP3R1 assembly. In 
this scenario, binding of a single InsP; molecule to one subunit 
can trigger a cascade of conformational changes in domains of two 
neighbouring subunits to propagate the activating signal to the ion- 
conducting pore. 


Conclusion 


Here we present the first, to our knowledge, near-atomic resolution 
structure of a eukaryotic intracellular InsP3-gated Ca*™ release chan- 
nel purified from rat cerebellum. Our structural analysis reveals con- 
servation of the ion-conduction pore across the tetrameric cation 
channel family, further suggesting a conserved mechanism for trans- 
location of ions in these channels. However, the control of channel 
gating in a cellular context is conferred by additional domains that are 
attached to the pore and contain specific functions, such as voltage- 
sensing or ligand-binding. The distinctive molecular architecture of 
these domains defines the nature of work that needs to be performed 
to convert cellular signals into mechanical gating motion. Thus, while 
InsP3R1 shares a set of common features with the closely related RyR1 
channel, the unique architecture in its C-terminal domain suggests a 
distinctive allosteric mechanism underlying activation of InsP3R gat- 
ing (Fig. 6). This mechanism requires direct mechanical coupling 
between the C terminus and InsP3-binding domains of adjacent sub- 
units and involves rearrangements of the inter-domain interfaces 
identified in this study. To our knowledge, this is the first structural 
evidence for the recruitment of the C terminus in gating of the InsP3R 
channel. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Statistical methods for predetermining sample sizes are not appropriate for cryo- 
EM analysis, which typically involves 10*-10° images. Instead, a range of more 
sophisticated validations and error estimates were performed as described below. 
Cryo-EM data acquisition. Detergent-solubilized InsP3R1 was purified from rat 
cerebellum, and its structure-function integrity has been confirmed as described 
in our earlier study’. The absence of any channel-specific modulatory proteins 
has been confirmed using immunoprecipitation/mass spectrometry (data not 
shown)™. The vitrification of the purified protein was performed as previously 
described’. Images of frozen-hydrated InsP3R1 particles were acquired on a 
Technai G2 Polara electron microscope (FEI) operated at 300kV using a K2 
Summit direct electron detector camera (Gatan). Images were collected in dose 
fractionation super-resolution counting mode at a nominal magnification of 
23,000, corresponding to a calibrated physical pixel size of 1.62 A and super- 
resolution pixel size of 0.81 A. The dose rate on the camera was set to ~10 elec- 
trons pixel’ s~'. The total exposure time was 6s, leading to a total accumulated 
dose of 22 electrons A~* on the specimen. Each image stack was fractionated into 
30 subframes, each with an accumulation time of 0.2 per frame. Images were 
acquired at the defocus range of —0.6 to —3.5 um. 

Image processing and 3D reconstruction. Dose-fractionated super-resolution 
raw image stacks of ice-embedded InsP3;R1 were binned 2 X 2 by Fourier crop- 
ping resulting in a pixel size of 1.62 A for further image processing. Each image 
stack was subjected to motion correction using ‘dosefgpu_driftcorr’®, and a sum 
of subframes 1-29 in each image stack was used for further image processing 
(Extended Data Fig. 1). We used ‘e2evalimage.py’ in EMAN 2.1 package to select 
3,743 micrographs from a total of 4,160 micrographs for subsequent processing. 
The signal in the motion-corrected images extends beyond ~4A (Extended 
Data Fig. 1b). 

156,805 particles were boxed out manually using ‘e2boxer.py’. Image proces- 
sing was then performed independently in both RELION1.3 and EMAN 2.1 
beginning with the same set of boxed out particles. For the RELION1.3 recon- 
struction, defocus and astigmatism were determined for each micrograph by 
CTFFIND3 (ref. 36). Our previously published map (EMDB accession 5278) 
was low pass filtered to 60 A resolution and used as a starting model for the 
RELION1.3 refinement. The first refinement yielded subnanometer resolution. 
After this step, we ran several rounds of iterative 3D classification and 3D auto- 
refinement to extract the most self-consistent subset of the particle data set. The 
final map was generated from 96,106 particles. 

The EMAN2.1 reconstruction followed standard refinement procedures”*” 
with an initial model generated from the current data set with no reference to 
the prior published structure. On the basis of per-particle SSNR estimates, the 
best 105,000 particles were included as input to the refinement. During refine- 
ment, the worst ~30% of these particles were discarded from each reference- 
based class-average, based on mutual similarity among particles. Refinement used 
the EMAN2.1 ‘e2refine_easy’ script, in which most refinement parameters were 
automatically selected. An initial low resolution refinement was performed using 
a smaller subset of the data to improve the initial model. As a result of image 
quality improvement, 2D class-average images generated through iterative image 
processing exhibit significantly more details than in previous studies* (Extended 
Data Fig. 1c). 

The particle orientation distribution is not entirely uniform with a broad 
distribution focused on a ~45° wide distribution near the equator (side views) 
and another grouping near the top view, along the axis of symmetry (Extended 
Data Fig. 2d). Given the C4 group symmetry, this distribution includes consid- 
erable numbers of particles extending from the equator to the pole, which is 
sufficient for a complete reconstruction with no missing information. 

The final RELION1.3 and EMAN2.1 refinements both used gold standard 
procedures***’. In this method, the particle data were split into even/odd num- 
bered halves, and two maps were refined completely independently. The Fourier 
shell correlation (FSC) 0.143 cut-off was used to estimate the resolution of final 
3D reconstructions with a soft auto-mask in RELION post-processing (Extended 
Data Fig. 2a). RELION1.3 density map was sharpened by applying a B-factor of 
—256 A° that was estimated using an automated procedure® and visualized 
with Chimera“. EMAN2.1 maps were automatically filtered as part of the refine- 
ment. Local resolution variations were estimated using ResMap* (Extended 
Data Fig. 2c). 

The resulting 3D reconstructions from the two software packages, RELION1.3 
and EMAN2.1, were assessed by computing an FSC between these two maps. 
Since both maps used the full data set, it is more appropriate to use a 0.25 FSC 
threshold in this comparison, which corresponds to 2X the SSNR of the 0.143 
threshold (Extended Data Fig. 2b). Note, that regardless of localized differences at 
the highest resolutions, the topology of the two maps is consistent, and the same 


protein model can be optimized for both maps. All structure/function interpreta- 
tions would be identical for both maps, irrespective of the resolution value 
ascribed to the maps. 

Model building. Before building a model for the InsP;R1 subunit, secondary 
structure identification was performed with SSEHunter in Gorgon**. A total of 
106 helices were identified and a series of 3-sheets in the central apical domains of 
cytosolic region were identified per InsP3;R1 subunit. Sequence analysis was then 
performed on the InsP3R1 primary sequence (GI accession 17380349). As the 
entire protein was generally too large for most web-based services, serial over- 
lapping sequence segments were used to initially screen the sequence; sequence 
segments were ~800 amino acids in length and overlapped by ~200 amino acids 
in either direction. Secondary structure prediction was performed using 
JPRED3*. Analysis of the secondary structure illustrated a predominance of 
helices, except over the first ~500 amino acids which were indicated to be mainly 
B-sheet. Deriving the molecular structure of InsP;R1 used a three-pronged 
approach to generate initial models for different parts of the protein sequence 
depending on the availability of available crystal structures, fold recognition using 
homologous structures and completely de novo modelling (Extended Data 
Fig. 4a). Fold recognition was first performed on the InsP3R1 sequence using 
PHYRE2 (ref. 46). As expected, the known crystal structures of the N-terminal 
domains of InsP3R (PDB accessions 3T8S and 3UJ4) were identified spanning 
residues 7-580. In addition, the structure of the RyR1 N-terminal domain was 
identified (PDB accession 2XOA). Two structural homologues over residues 
1104-1431 were identified; the ARM repeat from 1N4K and an o-« superhe- 
lix/ARM repeat (PDB accsesion 1XQR) with 99.7% (25% identity) and 42% (12% 
sequence identity) confidence, respectively. Models for these domains were 
obtained directly from the PHYRE2 website (intensive option) and fit to the 
InsP3R1 density map using UCSF Chimera’s ‘fit in map’ module, EMAN’s 
Foldhunter and Gorgon”. The N-terminal model was localized to LBD and 
ARM1 regions of the density map. The composite model for residues 1104-1431, 
containing an o&-o superhelix motif, was localized in the ARM2 region of the 
density map. The secondary structure elements in the models matched well with 
those identified in the cryo-EM density map, further validating the fit of these 
structures. Together, these two models accounted for ~33% of the total amino 
acids in the InsP3R1 subunit. A third homologous domain, the crystal structure of 
a voltage-gated Na* channel (PDB accession: 4DXW), was found using PHYRE 
corresponding to the TMD residues (residues 2278-2605) with 98.7% confidence 
and ~ 19% sequence identity. While the structural homologue was fit to the TMD 
of our InsP3R1 cryo-EM density map, it was not directly used for modelling. The 
voltage-gated Na” channel structure was used to confirm the putative InsP3R1 
subunit boundaries in the TMD. Modelling of the TMD and the remaining 
residues were accomplished de novo as described below. 

The remainder of the sequence for InsP3R1 was modelled using our previously 
developed de novo modelling protocol**“’ (Extended Data Fig. 4a). Essentially, we 
used the aforementioned structural homologues to anchor our sequence assign- 
ments. Extending from the ends of the models, we identified the next directly 
connected secondary structure element via density and/or density skeleton. We 
then attempted to match the observed SSEHunter secondary structure elements 
to what was predicted in the primary sequence; the fitted models provided a rough 
registration of the sequence in the density map, making this process feasible. A Cat 
backbone model was then constructed for this region using either Gorgon” or 
Coot*. The modelling procedure was repeated extending from the previously 
placed secondary structure elements until either the models merged or encoun- 
tered a break in the density. It should be noted that model construction was done 
with the aforementioned 4.7A resolution RELION1.3 map, preserving the 
EMAN2.1 map to act as a control. The Ca, backbone model was subsequently 
verified using our Pathwalking protocol”, which helps to identify alternate back- 
bone topologies. Through this process, less than 15% of the residues were not 
modelled, likely due to the poor resolvability of the map in those regions 
(Extended Data Fig. 4a; see main text). 

Once a complete Ca backbone model was constructed, the model was trans- 
formed into a full atom model using REMO* and manually adjusted to best fit the 
density in Coot’. As the density in the transmembrane region was best resolved, 
model optimization to the density began in the transmembrane regions followed 
by the remaining regions. From this initial Coot-optimized model, the full-atom 
model for the tetrameric assembly was then refined with Phenix’s real-space 
refinement tools as previously described™. In brief, we first used the default 
real-space refinement settings, followed by subsequent rounds of refinement 
(~10) incorporating simulated annealing, global minimization, rigid body min- 
imization and local grid searching. After Phenix refinement, the model was again 
manually adjusted in Coot to best fit the density. This process was iterated over 
10 rounds of map-model refinement with Phenix and Coot. As a final test of 
model fidelity and map agreement, the final model refined against the 
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RELION1.3 map was then in turn refined against the EMAN2.1 map. This result- 
ing refinement gave a nearly identical model with an r.m.s.d. of only 0.8 A. Inter- 
and intra-subunit interfaces described here were identified using PDBSum” and 
PDBE PISA*. 

It should be noted that the de novo modelling process relies on the ability to 
register sequence predicted helices with those found in the structure. This regis- 
tration is further enforced by anchoring large, bulky side chains to protruding 
densities in helices. However, potential errors in the map, modelling and refine- 
ment procedures may result in register shifts, non-optimal side-chain placement 
and tracing errors where there are not sufficient density anchors or where the map 
is not as well resolved”. 
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Extended Data Figure 1 | Cryo-EM of the purified InsP3R1. a, Representa- _ extending to 3.8 A.c,2D projections of the InsP3R1 map from the last iteration 
tive electron image of ice-embedded InsP3R1 taken at a defocus of 2.1jmand of refinement by RELION1.3 (rows 1 and 4) and EMAN2.1 (rows 2 and 5) are 
recorded using the K2 Summit camera. Shown image is motion-corrected. shown with corresponding 2D class averages (rows 3 and 6). 

b, Fourier power spectrum of the micrograph shown in a with Thon rings 
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Extended Data Figure 2 | Resolution estimation of cryo-EM 3D 
reconstruction. a, The gold-standard FSC curve for the final cryo-EM 3D 
reconstruction generated with RELION1.3 (black) and EMAN2.1 (green). 
The overall resolution is 4.7 A using the FSC cut-off = 0.143. b, Shown is the 
FSC curve between the cryo-EM density map generated with RELION1.3 
and the molecular model (blue, FSC cut-off is 0.5) and the FSC curve between 
the 3D reconstructions generated with EMAN2.1 and RELION1.3 (red, FSC 
cut-off is 0.25). c, The cryo-EM density map of InsP;R1 generated with 
RELION1.3 is colour-coded based on ResMap (Methods). The map is depicted 


as viewed along the membrane plane (left) or as a slab of the density (dashed 
box in left panel) coincident with the four-fold channel axis (right). The cryo- 
EM density map exhibits local resolution variation ranging from 3.6 A to 6.5A 
with the most highly resolved densities in the TMD, while parts of the 
peripheral densities in the cytoplasmic region are of lower resolution. The low 
resolution peripheral density in the transmembrane region is attributed to 
detergent bound to the protein. d, Euler angle distribution of all particles used 
for the final 3D reconstruction. Each view is represented by a sphere, for which 
the size is proportional to the number of particles for this specific view. 
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View from cytosol 


Extended Data Figure 3 | 3D cryo-EM density map of the tetrameric (bottom). Four individual subunits are colour-coded. The map is rendered at a 
InsP;R1 visualized in three orthogonal orientations. InsP;R1 viewed from __ threshold level corresponding to a molecular mass of ~1.3 MDa. 
the cytosol (top), along the membrane plane (middle) and from the lumen 
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Extended Data Figure 4 | Building an atomic model of InsP3R1. a, Linear 
representation of a primary structure of InsP3R1 protein (GI accession 
17380349). Ten domains identified in the cryo-EM density map are colour- 
coded. Three sites of alternative splicing (residues 318-332/SI, 918-926/SIII 
and 1692-1731/SII) are indicated below the sequence bar. Putative binding 
sites for several channel-specific ligands are indicated above the domains 
(ATP, ATP-binding CaM/CaBP, calmodulin/Ca?t binding protein; CGA, 
chromogranin A; cyt c, cytochrome c; Htt“?, huntingtin; PKA, protein kinase 
A; RIH, RyR/InsP3R homology; yellow circles denote Ca”* -binding”). The 
panel below shows a linear diagram of the protein sequence colour-coded based 
on the approach used for modelling different regions in the primary structure. 
The spaces between the bars correspond to unmodelled sequence (see also 


Extended Data Fig. 5). b, The structure of the N-terminal domain (NTD) based 
on cryo-EM density map (red) is shown overlapped with the X-ray crystal 
structures of NTD (r.m.s.d. values = 1.3-1.4 A): 1XZZ (blue); 3T8Sa (cyan); 
3T8Sb (green); 3UJ4 (yellow); and 3UJO (magenta). c, Cryo-EM density 
map is viewed along the membrane plane. Densities corresponding to the 
individual domains of one subunit of InsP3R1 are colour-coded as in 

a. d, Models for solenoid-like «-helical domains are shown superimposed on 
their corresponding cryo-EM densities. ARM1-ARM3, armadillo repeat 
domains1-3; HD, o-helical domain. The densities of ARM2 domain are less 
resolved than those of ARM1 and ARM3 but are sufficient to trace the 
backbone. 


©2015 Macmillan Publishers Limited. All rights reserved 


rIp3Ri 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3Rl 
BIP3R1 
hIP3R2 
hIP3R3 


rIP3Ri 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
bIP3R1 
bIP3R2 
hIP3R3 


rIP3Ri 
hIP3R1 
hIP3IR2 
BIP3R3 


rIp3Ri 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3Ri 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3Ri 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3Ri 
hIP3R1 
hIP3R2 
hiIp3R3 


rIP3R1 
hIP3R1 
hIP3R2 
RIP3R3 


rip3Ri 
bIP3R1 
hIP3R2 
hIP3R3 


rIP3Ri 
hIP3R1 
hIP3R2 
hIP3R3 


rIP3R1 
hIP3R1 
hIP3R2 
hIP3R3 


ARTICLE 


MSDKMSSFLEIGDICSLYAEGSTNGF ISTLGLVDDRCVVQPEAGDLENPPKKFRDCLFKLCPMNRY SAQKQFWKAAK PGANS ~~ TTDAVLLNKLEHAADLEKKQNETENRKLLGTVIQYGNVIQLLELKSNKYLTV 
MSDKMSSFLHIGDICSLYAEGSTNGF ISTLGLVDDRCVVQPETGDLNNPPKKFRDCLFKLCPMNRY SAQKQFWKAAKPGANS ~--TTDAVLLNKLHHAADLEKKQNETENRKLLGTVIQYGNVIQLLHLKSNKYLTVN 135 
MTEKMSSFLY IGDIVSLYAEGSVNGF ISTLGLVDDRCVVHEPEAGDLANPPKKFRDCLFKVCPMNRY SAQKOY WKAKQAKQGN ~~ HTEAALLKKLOQHAAELEQKQNESENKKLLGEIVKYSNVIQLLEIKSNKYLTVN 135 
Se ee eee rey Cee AR eee DEELEY RECUR RCE MENS SARE TENEAED TE DRE EEL ADU SL LANL DRA R SUEDE DUST ENE N Eee Cente eer aege 136 
Lense eee 10......-- 20.....+.+.- 30...222- 40.......- 5O.....+-- 1) Pee eee 80.......- 90......-.100....... 110... cece 120......+130..... 


7 9 10 as 1 12 13 aS 


KRLPALLEKNAMRVTLDEAGNEGSWFY IQPFYKLRSIGDSVVIGDKVVLNPVNAGQPLHASSHQLVDNPGCNEVNSVNCNTSWKIVLFMKWSDNKDDILKGGD VVRLFHAEQEKFLTCDEHRKKQHVFLRITGROQSA 272 
KRLPALLEKNAMRVTLDEAGNEGSWFY IQPFYKLRSIGDSVVIGDKVVLNPVNAGOPLHASSHOLVDNPGCNEVNSVNCNTSWKIVLFMKWSDNKDDILKGGD VVRLFHAEQEKFLTCDEERKKQHVFLRITGROSA 272 


KRLPALLEKNAMRVSLDAAGNEGSWPFY IEPFWKLRSEGDN I VVGDKVVLMPVNAGOPLHASN I ELLDNPGCKEVNAVNCNTSWKITLFMKYSSYREDVLKGGD VVRLFHAEQEKFLTCDEYEKKQHIFLRITLROSA 272 
KRLPALLEKNAMRVTLDATGNEGSWLF I QPFWKLRSNGDNVVVGDKVILNPVNAGQPLHASNYELSDNAGCKEVNSVNCNTSWKINLFMQFRDE 'LKGGD VVRLFHAEQEKFLTCDEYKGKLQVFLRITLRQSA 273 
+ -140....... 150. ..cc06 160....... 170......- 180....... 190....... 200....... 210.......- 220....... 230.......- 240. ....+250.......- 260.......270.. 

14 15, 16 1 as a7 a8 


QSATSATSSKALWEVEVVQEDPCRGGAGIWNSLFRFKELATGHYLAAEVDPDFEEECLEF OPS VDPDQDASRSRLRNAQEKMVYSLVSVPEGNDISSIFELDPTTLRGGDSLVPRNSYVRLRELCTNTWVHSTNIPI 
QSATSATSSKALWEVEVVQEDPCRGGAGIWNSLFRFKELATGHYLAAEVDPDFEEECLEFOPSVDPDQODASRSRLRNAQEKMVY SLVSVPEGNDISSIFELDPTTLRGGDSLVP. 


QSATSATSSKALWE I EVVEEDPCRGGAGQWNSLFRFKELATGNYLAAELNPDY RDAQNEG' RDGVPPTSK-KKRQAGEKIMYTLVS IASLFELDATTLQRADCLVP: 
ALWEVEVVEEDPCRGGAGEWNGLYRFKELATGNYLAAEENPSYKGDASD KAAGMGAQGRTGRRNAGEKIKYCLVA |GNDIASLFELDPTTLOKTDSFV YVRLRELCTNTWIQSTNVPI 
BLO 6 oe e320. oe TID ween B40 nooo e350. eee F6Oe oe FTO 6 oo ee FBO 6 oe 39D. ee - 400.~.-6» 


all al2 at3 aig ai5 


DKEEEK PVMLKIGTSPLKEDKEAFAIVPVSPAEVRDLDFANDASKVLGS IAGKLEKGT I TONERRSVTKLLE DLVYFVTGGTNSGQDVLEVVFSKPNRERQKLMREQNILKQIFKLLQAPFTD-CGDGPMLRLEELG 542 
DKEEEK PVMLKIGTSPVKEDKEAFAIVPVSPAEVRDLDFANDASKVLGS IAGKLEKGT I TQONERRSVTKLLE DLVYFVTGGTNSGQDVLEVVFSKPNRERQKLMREQNILKQIFKLLQAPFTD-CGDGPMLRLEELG 542 
DIDEER PVMLKIGTCQTKEDKEAFAIVSVPLSEVRDLDFANDANKVLATTVKKLENGT I TQNERRFVTKLLE DLI FFP VADVPNNGQEVLDVVITKPNRERQKLMREQNILAQVFGILKAPFKEKAGEGSMLRLEDLG 542 
Seema Cee eh eee Ce ee eee DLVFFVSDVPNNGQNVLDIMVTKPNRERQKLMREQNILKQOVFGILKAPFREKGGEGPLVRLEELS 542 
~4210....... 420....... 430.....-. 440.......450......- +++520...---- 530.......540... 


ai5 ral al7 ral at « 


DQRBAPFREICRLCYRVLRESQODYRENQEY IAKOPGFMOKOIGYDVLAEDTI TALLENNRKLLEKHITAAEIDTFVSLVRKENREPRFLDYLSDLCVSMNKSI PVTQELICKAV! NADILIETKLVL: 
DQRHAPFREICRLCYRVLRESQQDYRENQEY IAKQFGFMQKQIGYDVLAEDTI TALLENNRKLLEKHITAAEIDTFVSLVRKNREPRFLDYLSDLCVSMNKSIPVTQELICKAVLNPTNADILIETKLVLSRFEFEG 679 
DQRYAPYKYMLRLCYRVLRESQQDIRKNQEY IAKNFCVMQSQIGYDILAEDTITALLENNRKLLEKHITAKEIETPVSLLRRNREPRFLDYLSDLCVSNTTAIPVTOELICKPMLSPGNADILIOTKVVSMOADNPM 679 


DQOKNAPYQEMPRLCYRVLRESQEDYRENQER IAKOFPGMMOSOIGYDILAEDTI TALLENNRKLLEKHITKTEVETFVSLVRKENREPRFLDYLSDLCVSNEIAIPVTQELICKCVLDPKNSDILIRTELRPVKEMAQS 679 
pone repre Rec rece Berner BeOoe tern BeOcaraeas then -~+-610.......620. «= -630.......640.. --650.......660.. = +670.....-.4 680 
a29 a0 


VS-TGENALEAGEDEEEVWLFWRDSNKE I RSKSVRELAQDAKEGQKEDRDVLSY YRYOLNLFARMCLDROYLAINEISGQOLDVDLILRCMSDENLPYDLRASFCRLMLEMEVDRDPQEQVTPVKYARLWSEIPSEIA 
VSSTGENALEAGEDEEEVWLFWRD: EIRS LAQDAKEGQKEDRDVLSYYRYQLNLFARMCLDRQYLAINEISGQLDVDLILRCMSDENLPYDLRASFCRLMLEMEVDRDPQEQVTPVKYARLWSEIPSEIA 


Es SILSDDIDDEEVWLYWI DSNKEPHGKAIRELAQEAKEGTKADLEVLTYYRY OLNLFARMCLDROQYLAINQISTQLSVDLILRCVSDESLPFDLRASFCRLMLEMEVDRDPQESVVPVRYARLWTEIPTKIT 

a EYLSIEYSEEEVWLIWITDENNE KSVRQLAQEARAGNAEDENVLSYYRYQOLKLFARMCLDROYLAIDEISQOLGVDLIFLCMADEMLPFDLRASFCHLMLEVEVDRDPQELVTPVKFARLWTEIPTAIT 

RT (ye 5 etn Ty re 730-+-+--- rt eee eee 160-+++-++ TTO+++++e% 7BO-+++++ T90+++ +++ 800-+-+--- 810---+- 
«31 «32 «33 


IDDYDSSG-ASKDEIKERFAQTMEFPVEEYLROVVCORFPPPSOKEKNKLTFPEVVNLARNLIYPGPYNFSOLLRLTKILLAILDCVHEVITIFPPISKMTKGEENKG NVMRSIHGVGELMTOVVLRGGG 
IDDYDSSG-ASKDEIKERFAQTMEF VEEYLRDVVCORFPFSDKEKNKLTFEVVNLARNLIYFGPYNFSDLLRLTKILLAILDCVHVTTIFPISKMAKGEENKGNNDVEKLKSSNVMRSIHEGVGELMTQVVLRGGG 


IBEYDSITDSSRNDMKRKFALTMEF VEEYLKEVVNOQPF PFGDKEKNKLTFEVVELARNLI YF GFYSFSELLRLTRILLAILDIVQAPMSSYFERLSKFQDGG-- NNVMRTIHGVGEMMTQMVLSRGS-- 937 
IKDYDSNLNASRDDKKNKFANTMEF VEDYLNNVVSEAVPFANEEKNKLTFEVVSLAENLI YF GFYSFSELLRLTRILLGI IDCVQG-~--PPAMLOQAYEDPGG KNVRRSIQGVGHMMSTMVLSRKQSV 935 
820---- 830---- ae COs Resear ee ar Seri fete 6) arr pesbaee -940- ee eee sence. 

36 «37 
PLPM TPMAAAPEG EKEDIMVMDTKLKI IEILQFILNVRLDYRISCLLCIFKREFDE TSSGNSSQEGPSNVPGALDFEH I EEQAEGIFGGSEENTPLDLDDHGGRTFLRVLLELTMEDYPPL 1075 
PLPM TPMAAAPEGN-VKQAEP-EKEDIMVMDTKLKI IEILQF ILNVRLDYRISCLLCIFKREFD! IEEQAEGIFGGSEENTPLDLDDHGGRTFLRVLLELTMEDYPPL jogs 


IFPMSVPDV I KQG: EREDVTVMDTKLKIIEILQFILSVRLDYRISYMLS I YKKEPGEDNDNAETSASGSPDTLLPSAI --VPDIDEIAAQAETMFAGRKEKNPVOLDDEGGRTFLRVLIHLIMEDYPPL 1072 
PSAP SLSAGASAAEPLORSKFEENEDIVVMETKLKILEILQFILNVRLDYRISYLLSVFRKEFPVEVF PMQDSGADGTAPAFDSTTA! INLDRIGEQAEAMFGVGKTSSMLEVDDEGGRMFLRVLIELTMEDYAPL 1070 
«++ 960. ......970-----+-980......-990......-1000......1010......1020......1030......1040......1050......1060......1070......1080......1090 

40 rt a42 a43 ats ad 46 


VSGALQLLFREF SQRQEVLQAFKQVOLLVTSQDVDNYKQIKQDLDQLRS I VEKSELWVYK--GQGPDEPMDGA: YNYRVVKEILIRLSKLCVQESASVRKSRKQ 1200 


ERKKTEEGTSKPLKEEST 


VSGALQLLFREF SQRQEVLOAFKOVOLLVTSQDVDNYIKQOIKODLDOLRSI VEKSELWVYK--GQGPDETMDGAS -GE -NEHKKTEEGNNKPQKHESTSS YNYRVVKEILIRLSKLCVQESASVRKSRKQ 1210 
\QRAEVLOAPKOVOLLVSNOQODVDNYKOIKADLDOLRLTVEK NYENGEIGESQVKGGE ILSPVQDGTKKPQ-IDS RIVKEILIRLSKLCV! ~KKCRNQ 1204 
VSGALQLLFKEF SQRQEAMETFRKOQVQLLISAQDVENYKVIKSELDRLRTMVEKSELWVDKKGSGKGEEVEAGAAKDK- — ~KERPTDEEGFLEP GEKSS ENYQIVKGILERLNKMCGVG----EQMRKK 1192 
eeeeee 21100......2110......1120......1130......1140......1150......1160......1170......1180......1190......1200......1210......1220......- 
a46 a47 ass a49 ado a5t a52 a53 ad4 
QQRLLRNMGABAVVLELLOI PYEKAEDTKMOQE IMRLAHEFLONFCAGNHOQONOQALLEKHINLFLNPGILEAVTMOHIFM: EINERVVQHFVHECIETHGRNVOYIKFLOTIVKAEGKFIKKCQDMVMAELVNS 1347 
BQRLLKNMGAESVVLDLLOQI PYEKNDE-KMNEVMNLAETFLONFCRGNPQNOVLLEKELNLFLTPGLLEAETMRE IF’ RVVQHFVECIETHGREVEYLRFLOTIVKADGKYVKKCODMVMTELING 1338 
QQRLLKNMDAEKVMLDLLOQI PyYDKG-DAKMMEILRYTHOFLOKFCAGNPGNOQALLEKHLELFLTPGLLEAETMQHIFLNNYQLC: SEPVLQHFVELLATHGREVQYLDFLETVIKAEGKYVKKCQDMIMTELTNA 1328 
wee eee eee L240.....-1250......1260......1270......1280......1290......1300......1310......1320...... 1330..... +1340...... 1350...... 1360.... 


a4 55 adé aS8 62 


DCIPEVKIAYIN FPLVDICRACNNTSDRKHADSV 1474 
GEDVLVF YNDRASFQTLIQMMRSERDRMDENSP-LMYHIBLVELLAVCTEGKNVYTEIKCNSLLPLDDIVRVVTHEDCIPEVKIAY INFLNECYVDTEVEMKEIYT: FLVDICRACNNTSDRKHADSI 1483 
GEDVLIFYNDRASFPILLEMMCSERDRGDESGP-LAYHITLVELLAACTEGKNVYTE IKCNSLLPLDDIVRVVTHDDCI PEVKIAYVNFVNECYVDTEVEMKEIYT IWKLFENFPLVDMARVCNTTTDRKBADIF 1474 
GDDVVVF YNDKASLABLLDMMKAARDGVEDES P-LMYHISLVDLLAACAEGKENVYTE IKCTSLLPLEDVVSVV THEDCI TEVKMAYVNFVNECYVDTEVEMKEIYT IWTLFENFTLDMARVCSKR-EKRVADPT 1463 
-1410......1420.. «1430......1440......1450......1460......1470...... 1480......1490......1500. 


a63 
(III «+ «+ + == nee e eee e ee ee eee e eee eeeeeeee Ae ee eer een e eee nen eeneeneneeeeeenneeaaneneseenenenneneeee 


LEKYVTEIVMSIVITFFSSPFSDOSTTLOTROPVFVOLLOGVFRVY HCNWLMPSOKASVESCIRVLSDVAKSRAIAIPVYDLDSQVNNLFLK- 
LEKYVTEIVMSIVTTFFSSPFSDQOSTTLOTROPVFVOLLOGVFRVYECNWLMPSQKASVESCIRVLSDVAKSRAIAI PVDLDSQVNNLFLK- 
LEKCVTES IMNIVSGFFNSPFSDNSTSLOTHQPVFIQLLQSAFRI YNCTWPNPAQKASVESCIRTLAEVAKNRGIAIPVDLDSQVNTLFMK NMVQRAAMGWRLSARSGPRFKEALGGPAWDYRNIIEKLOQDV 1609 
LEKYVLSVVLDTINAFFSSPFSENSTSLOTHOTIVVOLLOSTTRLLECPWLOQOHKGSVEAC I RTLAMVAKGRAILLPMDLDAHI SSMLSSGASCAAAAQRNASSYKATTRAFPR--VIPTANQWDYKNIIEKLQDI 1598 
«ee++1510......1520......1530......1540......1550.....-1560......1570......1580......1590......1600......1610......1620......1630.....46: 


a63 a65 66 67 


HN-IVQKTAMNWRLSARNAAR-RDSVLAASRDYRNIIERLQDI 1607 
IVQKTAMNWRLSARNAAR-RDSVLAASRDYRNITERLQDI 1616 


VSALEDRLRPLVQAELSVLVDVL: 
VSALEDRLRPLVQAELSVLVDVLERPELLF PENTDARRKCESGGF I CKLIKETKQLLEENEEKLC I KVLQTLREMMTKDRGYGEKLISIDELDNAELPPAPDS 
VASLEEQFSPMMQAEFSVLVDVLYSPELLF PEGSDARIRC--GAPMSKLINETKKLMEK-EEKLCIKILQTLREMLEKKDSFVEE 
ITALEERLKPLVQAELSVLVDVLEWPELLFLEGSEAY QRCESGGFLSKLI QHTKDLMES-EEKLCIKVLRTLQOMLLEKTKYGDR 


-GNTLRKILLNRY 1703 
-GNQLREMLLOQNY 1694 


seeceees 1650...... 1660.....-1670......1680......1690......170 -1710......1720......1730... -1740.....-1750......1760......1770..... 
aés aé9 a70 a71 a72 


ESLTSFGNGPLSPGGPSKPGGGGGGPGSGSTSRGEMSLAEVQCELDKEGASNLVIDLIMNASSDRVFHES ILLAIALLEGGNTTI QUSFFCRLTEDKKSEKFFKVF ¥ DRMKVAQQEIKATVTVNTSD 
SAGGPGKPGGGGGGSGS ISRGEMSLAEVQCELDKEG: LVIDLIMNASSDRVFHESILLAIALLEGGNTTIQHSFFCRLTEDKKSEKFFKVFYDRMKVAQQEIKATVTVNTSD 
VNGHLSGAYSKTAQVGGSF SGQDSDKEMG~ ISMSDIQCLLDKEGASELVIDVIVNTKNDRIFSEGIFLGIALLEGGNTQTOYSF YQQLHEQKKSEKFFKVLY DRMKAAQKEIRSTVTVNTID 
=~=----AIAATQCRLDKEGATKLVCDLITSTKNEKIFQESIGLAIBLLDGGNTE I QKSFENLMMSDKKSERFFKVLEDRMKRAQQETKSTVAVNMND 
1800......1810......1820......1830......1840......1850......1860......1870......1880......1890......1900......1910.. 


a73 a74 a75 a76 


LGNKKKDDEVDRDAP- ADPD-DHY QSGEGTQATTDKAKDDLEMSAVITIMOPILRFLOLLCE DLONFLRCQNNKTNYNLVCETLOPLDCICGSTTG 2016 
LGNKKKDDEVDRDAP-SRKKAKEPTTQ ADPD-DHYQPGEGTQATADKAKDDLE VITIMQPILRFLQLLCE DLOQNFLRCQNNKTNYNLVCETLOFLDCICGSTTG 2024 
LGNKKRDDDNELMTSGPRMRVRDSTLELKEGMKGQLTEASSATSKAYCV MDPE IDIMCTGPEAGNTEEKSAEEVTMSPAIAIMQPILRFLOLLCE ELQNFLRNQNNKTNYNLVCETLOFLDCICGSTTG jo¢9 
LGSQPHED DLONFLRCQNNKTNYNLVCETLOFPLDIMCGSTTG i990 


weeee1921.....1930......1940......1950......1960......1970......1980. 19906 2000. 66 2010.2. 6 2020.66 6 20380... 1 ee -20GL.. eee eee 


81 
GLGLLGLY INEKNVALINQTLESLTEYCQGPCHENQNCIATHESNGIDIITALILNDINPLGKKRMDLVLELKNNASKLLLAIMESREDSENAERILYNMRPKELVEVIKKAYMQG-EVEFEDGENGEDGAASPRNV 2152 
GLGLLGLYI \VALINQTLESLTEYCQOGPCHENQNCIATE GKKRMDLVLELKNNASKLLLAIM: AERILYNMRPKELVEVIKKAYMQG-E 
GLGLLGLYI IVALVNQNLESLTEYCQGPCE GKYRMDLVLQOLKNNASKLLLAI ERILPNMRPRELVDVMKNAYNQGLECDEGDDEGGDDGVS 
GLGLLGLY INEDNVGLVIQTLETLTEYCOQGPCHENQTCIVTHESNGIDI ITALILNDISPLCKYRMDLVLQLKDNASKLLLALMESREDSENAERILISLRPQELVDVIKKAYLO 


acces 2060......2070......2080......2090......2100......2110......2120......2130......2140.....-2150......-2161.....2170......2180...... 
ass até a87 ass «89 a30 


VRLDRTMEQIVFPVPSI 
VRLDRTMEQIVPPVPSICEFLTK LRIYYTTERDEQGSKINDFFLR. 
DEGDEALKY YANHTAQIEIVRHEDRTMEQIVFPVPNICEYLTRESKCRVFNTTERDEQGSKVNDFFQQ 
KQOLQELLKPVKR I QEEEAEGI SSMLSLNNKQLSQMLKSSAPAQEEEEDPLAY YENETSQIE I VRODRSMEQIVFPVPGICOPLTEETKHRLFTTTEQDEQGSKVSDFFDQOSSFL: MEWOR 2186 
219D 6 oo 2200. 6 2ZTD oe 22ZD 6 oo ZZBD 6 oo Z2HD 6 oo  Z2GEO 6 6 oo ZZ2GCD 6 oe ZZTD 6 ooo 22B1 6 2290.6 1 ee  2300.....-2310......2320... 


«90 ad’ 92 ad. 94 


LAVLMNLLVAFFYPFKGVRG-GTLEPHWSGLLWTAMLISLAIVIALPKPH 2331 
KGVRG-GTLEPEWSGLLWTAMLISLAIVIALPKPH 2339 

2285 

KLRSMPLIYWFSRRMTLWGSISFNLAVFINIIIAPFYPYMEGASTGVLDSPLISLLF-WILICFSIAALFTKRY 2259 


++ +2330.....- 2340...... 2350...... 2360...... 2370.... «2380...... 2390....... 


©2015 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 5 | Sequence alignment of selected InsP3R channels. 


rInsP3R1, Rattus norvegicus (GI accession 17380349); hInsP3R1, Homo 
sapiens (GI accession 519668682); hInsP3R2, Homo sapiens (GI accession 
259016258); hInsP3R3, Homo sapiens (GI accession 209572633); the primary 
sequence numbering includes the first methionine. The numbering of 
residues is given below the sequences, secondary structure elements are 


ARTICLE 


indicated above the sequences and colour-coded in correspondence to the 
domains shown in Fig. 1b; dashed lines indicate regions that were not 
sufficiently resolved to be modelled. Given the enormous size of InsP3R 
proteins, the full-length sequence alignment was divided into two panels: 
sequence alignment for the transmembrane domains is shown in Extended 
Data Fig. 7a (note, overlap at the loop between the helices «90 and «91). 
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Extended Data Figure 6 | Representative cryo-EM densities. a-c, The cryo- _at Phe2586 within the ion conduction pathway (c), view from the cytosol along 
EM densities for some regions are shown overlaid with the corresponding the four-fold axis, colour-coded by subunit. 
model: cytosolic helices (a); transmembrane helices (b); the constriction point 
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Extended Data Figure 7 | Structural conservation of the pore among 
tetrameric cation channels. a, Alignment of the channel-forming domains; 
residues discussed in the text are labelled as following: blue circles, negatively 
charged residues; red circles, positively charged His2541; yellow highlight, 
N-glycosylation sites (Asn2476 and Asn2504); green highlight, selectivity filter; 
blue highlight, conserved Gly2587; green box, hydrophobic constriction region 


and Phe2586 shown in green; dark blue, Zn”” -finger-like residues Cys2611/ 
Cys2614 and His2631/His2636; blue box, tetramerization region. b, Structural 
comparison of pore-forming elements of InsP3R1, RyR1, K,1.2-2.1, NayRh, 
TRPVI1 (PDB accessions: 3J8H, 2R9R, 4DXW and 3J5P, respectively). Note 
substantial overlap between structures. 
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Extended Data Figure 8 | Comparison of InsP;R1 and RyR1 structures. 

a, Two opposing subunits of InsP;R1 and RyR1 (PDB accession 3J8H) are 
viewed along the membrane plane. One InsP3R1 subunit is colour-coded 

by domain (left). Structurally consistent domains in one RyR1 subunit are 
colour-coded using InsP3R1 domain architecture. Domains of RyR1 not in 
common are shown in grey. b, TMDs of RyRI1 (tan) and InsP3;R1 (coloured by 
subunit) are superimposed using Chimera’s Matchmaker and viewed from the 


cytosol (left) and lumen (right). The r.m.s.d. between 80 atom pairs is 2.0 A. For 
clarity, P-helices are not shown. c, Structural comparison of the selectivity 
filter (red) in InsP3R1 with that in some tetrameric cation channels: RyR1, 
TRPV1, TRPA1, Ca,Ab (PDB accessions: 3J8H, 3J5Q, 3J9P and 4MS82, 
respectively). Two opposing subunits are shown; TM5 and TM6 helices are 
coloured tan, P-helices are in green. 
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Extended Data Figure 9 | The putative cytosolic Ca’* sensor in InsP3R1. 
a, Cut-open side view of the cryo-EM density map of InsP3R1 is shown with the 
structures of the Ca** sensor region for InsP3R1 (residues 1952-2270) and 
RyR1 (residues 3877-4251; PDB accession 3J8H); the EF-hand of RyR1 
includes residues 4071-4130. b, Close-up view of the overlapped Ca’* sensor 
structures for InsP3R1 (colour-coded by domain as in Fig. 1) and RyR1 (tan). 
The conserved Glu2101/InsP3R1 and Glu4032/RyR1 are shown in red. 
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c, Sequence alignment of the predicted Ca** sensor regions comprising the 
conserved Glu2101/InsP3R1 and Glu4032/RyR1 (red) (rInsP3R1/GI1 
code:17380349; hInsP3R1/GI code: 519668682; hInsP3R2/GI code: 259016258; 
hInsP3R3/GI code: 209572633; RyR1/GI code: 134134; RyR2/GI code: 
308153559; RyR3/GI code: 75074791); green highlight denotes completely 
conserved residues; yellow highlight denotes identical residues. 


ARTICLE 


Q2700 K2701 


N396 


Extended Data Figure 10 | Structural coupling in the CTD. a, A bundle of the surfaces are colour-coded according to electrostatic charges calculated for 
the CTD helices is viewed perpendicular to the channel axis at the position of _ the model: blue denotes positive charges; red denotes negative charges. Shown 
a predicted inter-subunit saltbridge between residues Gln2700 and Lys2701. is a side view with the cytoplasmic side facing up. c, Close-up view of the 
The right panel shows the predicted salt bridge normal to the channel axis. interface between the CTD and B-TF2 domains of two neighbouring subunits; 
Helices are colour-coded per subunit. b, Structures of the CTD and B-TF2 viewed from cytosol perpendicular to the membrane plane. 

domains are superimposed on their corresponding cryo-EM densities and 
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Accreting protoplanets in the LkCa 15 transition disk 


S. Sallum!, K. B. Follette?, J. A. Eisner!, L. M. Close!, P. Hinz!, K. Kratter!, J. Males!, A. Skemer!, B. Macintosh?, P. Tuthill’, 
V. Bailey!, D. Defrére!, K. Morzinski', T. Rodigas*, E. Spalding!, A. Vaz! & A. J. Weinberger 


Exoplanet detections have revolutionized astronomy, offering new 
insights into solar system architecture and planet demographics. 
While nearly 1,900 exoplanets have now been discovered and 
confirmed’, none are still in the process of formation. Transition 
disks, protoplanetary disks with inner clearings”* best explained 
by the influence of accreting planets’, are natural laboratories for 
the study of planet formation. Some transition disks show evidence 
for the presence of young planets in the form of disk asymmetries’ 
or infrared sources detected within their clearings, as in the case 
of LkCa 15 (refs 8, 9). Attempts to observe directly signatures of 
accretion onto protoplanets have hitherto proven unsuccessful)”. 
Here we report adaptive optics observations of LkCa 15 that probe 
within the disk clearing. With accurate source positions over 
multiple epochs spanning 2009-2015, we infer the presence of 
multiple companions on Keplerian orbits. We directly detect Ha 
emission from the innermost companion, LkCa 15 b, evincing hot 
(about 10,000 kelvin) gas falling deep into the potential well of an 
accreting protoplanet. 

We observed LkCa 15 using the high-contrast imaging technique of 
non-redundant masking (NRM)"", at the Large Binocular Telescope (LBT) 
in Ks (A,=2.16 um) and L’ (A,=3.7 um; see Extended Data Table 1). 
We detect two components, LkCa 15 b and c, in both bands, with con- 
sistent positions across wavelength given the uncertainties (see Table 1, 
Extended Data Fig. 1). We detect a faint, third component, LkCa 15 d, at 
L’ only. Since d is significantly fainter than b and c, and not detected at Ks, 
we focus on the other two sources in the following analysis, but include 
discussion of the putative third companion where relevant. 


Table 1 | Model and experimental results 


We also observed LkCa 15 in Ha (A, = 655.8 nm) using the Magellan 
Adaptive Optics System (MagAO) in Simultaneous Differential 
Imaging (SDI)'”!3 mode (see Methods). We detect LkCa 15 b in these 
data, at a signal-to-noise of 6.4 and a position that agrees with the LBT 
observations (see Table 1, Extended Data Figs 2-5, Extended Data 
Table 2). LkCa 15 c was not detected in Ha, perhaps owing to higher 
extinction along the line of sight or lower accretion rates at the time of 
the observations. Both b and c lie well within the disk clearing (Fig. 1), 
which extends to a stellocentric radius of 56 au'4. 

We compare the positions of LkCa 15 b and ¢ to the infrared 
signal seen in 2009-2010 NRM observations®. As shown in Fig. 2, 
orbital fits (fixed to the outer disk plane: inclination i= 50°, position 
angle 0= 150°)’ suggest distinct orbits, with b moving faster (semi- 
major axis, a= 14.7 +2.1 au) thanc (a=18.6 +2.5 au). Taking the 
semimajor axis uncertainties into account and requiring that these 
orbits be stable, b and c must have masses lower than 5— 10 times 
that of Jupiter (M;)'*'°, with masses over 5 My allowed only in the 
case of a 2:1 resonance. For completeness, we performed a series 
of four-body simulations to show that stable orbital solutions exist 
including LkCa 15 d, with three planet masses <0.5My (see Methods, 
Extended Data Figs 6, 7) and higher masses for b and c allowed with 
a less massive d. 

We calculate infrared fluxes for LkCa 15 b and c, and compare them 
to circumplanetary accretion disk models'”'* and hot-start!? models 
of sub-stellar mass companions shortly after accretion has ceased (see 
Fig. 3). From the LkCa 15 A magnitudes (2MASS Ks =8.16”° and IRAC 
M3,6= 7.617"), we derive fluxes of 1.40.7 mJy at Ks and 2.5+1.2mJy 


Component Date Instrument A PA* (°) st(mas) At (mag) M§ (mag) Oe ye af] (AU) 
odel fit results 

LkCa 15 b 15 Nov 2014 MagAO Ha=656.3nm -104+3 93+8 5.2+0.3 15.8+0.3 3x 10-6 47434 

LkCa 15b 5-7 Feb 2015 LBT Ks=2.18 um —86+7e 125473 60+69 14.2438 10-5 A7 +34 

LkCa 15b 15 Dec 2014 LBT L’=3.8 um 100+% 106+%3 54493 13.6434 10-5 A7 +34 

LkCa 15 5-7 Feb 2015 LBT Ks=2.18 um —48 +73 85412 5.5488 3.7483 10-5 8.6455 

LkCa 15c¢ 15 Dec 2014 LBT L’=3.8 um —44+}6 68+73 48493 29492 10-5 8.6433 

LkCa 15d 15 Dec 2014 LBT L’=3.8 um 14433 87432 5.9424 14.1423 Bx 107° 8.0487 

LBT joint fit results 

LkCa 15b Dec 2014-Feb 2015 LBT Ks+L’ ~98 +15 95492 AKs=6.0+05 142+0.5 10-5 47434 
AL’=5.0+0.5 13.2+0.5 

LkCa 15c¢ Dec 2014-Feb 2015 LBT Ks+L’ 42435 80418 AKs=5.5+0.5 13.7+0.5 10-5 8.6435 
AL’=5.0+0.5 13.2+0.5 


«Position angle measured east of north. 
{Stellocentric separation. 

t+Contrast. 

§Absolute magnitude. 

| | Planet mass times accretion rate. 
qBest fit orbital semi-major axis. 
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Figure 1 | Composite Ha, Ks, and L’ image. a, The coloured image shows 
Ha (blue), Ks (green), and L’ (red) detections at the same scale as VLA 
millimetre observations”? (greyscale). b, Zoomed in composite image of 
LBT and Magellan observations, with b, c, and d marked. 


at L’ for b, and 2.3+1.1 mJy at Ks and 2.5+1.2 mJy at L’ for c. These 
are consistent with accretion disks having inner radii Ri, = 2Ry and 
planet mass times accretion rate M Ml ~10-° My; yr '. However, chang- 
ing Rin affects both the total disk flux and its colour. The large uncertain- 
ties on fluxes and colours allow us to constrain Rj, only to within a 
factor of ~2, translating to a factor of ~2-3 uncertainty in M M (for 
example, a Rin=1R), MM 23x 10° Mj? yr! disk can also reproduce 
the observations). 

While the hot-start model shown in Fig. 3 can approximately pro- 
duce the Ks and L’ emission for b and c, the observations are best 
explained by an accretion disk model. The hot start model can only 
match a previously established 1.55 1m upper limit on the contrast of 
the structure within the disk gap (AH = 7.2 mag)’ if the extinction is 
significantly higher than inferred towards the star. Moreover, even a 
highly extincted hot-start model cannot reproduce the strong emission 
at 4.7 um (contrast of AM = 3.5 mag)’. Emission from an accretion disk 
increases from L’ to M band, while the hot-start model produces little 
M band emission. Finally, a cooling photosphere produces no Ha emis- 
sion, firmly ruling out the hot-start model as the source of LkCa 15 b. 

Since LkCa 15 b is detected at Ha, an accretion tracer’?-*4, its nature 
as an accreting protoplanet is clear. LkCa 15 b’s Ha contrast, corrected 
for As Ha excess and assuming equal extinction to A (Ar=0.75 mag”), 
corresponds toa line flux of ~6 x 10~°Lo. Assuming similar accretion 
luminosity (Lac) scalings as low-mass T Tauri stars!*> gives 
Lacc¥4 X 10 ~“* Lo, yielding MpM ~3 x 10-°M/ yr! fora 1.6R; planet” 
(R; Jupiter radius). Previous observations showed that low-mass, 
accreting objects may emit a higher fraction of accretion luminosity at 
Ha”; assuming similar accretion scalings as T Tauri stars may 
overestimate L,.,. Extinction towards b is also uncertain; while we 
assume equal extinction to A and b, localized extinction can alter the 
numbers quoted above. While the uncertainties are large, this M yM is 
consistent with that estimated from the infrared fluxes. 


a b 
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M, M acc ~ 
10-5 M2 yr 


5 x 10°§ Mj yr" 


0.1 


ql 10 
A (um) 
Figure 3 | Spectral energy distributions. Symbols indicate fluxes for LkCa 


15 b (circles), c (squares), and d (diamonds), showing Ha (dark blue), Ks 
(green), and L’ (red). The light and dark blue arrows mark previously- 
published H-band? and 30 642 nm upper limits for LkCa 15 b, respectively. 
The lines show accretion disk and hot-start models. The disk models are 
simple combinations of blackbody spectra’, a suitable approximation for 
the case of a cool (T'< 1,500 K) stellar atmosphere where dust opacity 
dominates. The M,M calculated from the Ha flux agrees with that inferred 
from the infrared measurements (see text). 


Previous investigators posited a single protoplanet in LkCa 15, 
accreting material from its co-orbital surroundings®. While the sem- 
imajor axis uncertainties do not formally rule out b and c (and d, see 
Extended Data Fig. 6) being co-orbital, physical arguments show 
that they cannot be gravitationally bound. The size of the previously 
reported emission (several av) is larger than a Hill radius (~1.8 au for 
a 10 My planet (Mj, Jupiter mass) orbiting a 1 Mo star at 10 au), and 
much larger than the maximum possible size of a circumplanetary disk 
(~1/3 the Hill radius””). Thus the sources cannot be part of a bound, 
accreting system, and an alternative scenario is required to explain the 
observations. 

We argue further that it is difficult to explain LkCa 15 b and cc (and 
d) with an orbiting clump of gravitationally unbound dust within 
the disk gap, emitting thermally or in scattered light. At a distance 
of ~10 Au, neither LkCa 15 A nor a companion with a contrast of ~5 
magnitudes can heat dust sufficiently to emit at 2-4 um. Assuming 
isotropic single scattering, we calculate that an optically thin spher- 
ical clump of dust, perhaps resulting from a recent planetesimal col- 
lision, could produce the contrast observed at a single wavelength. 


50 


Position angle (°) 
Position angle (°) 
i=} 


I 
a 
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Time (years) 


Figure 2 | Position evolution. a, LkCa 15 b position angle and separation 
(inset) evolution, showing Ha (blue), Ks (green), and L’ (red). The earliest 
three points indicate previous observations’; others show fits to our data. 
Coloured and black 1o error bars are from a nonlinear algorithm and a 
grid, respectively (see Methods). The yellow shading spans the 1o allowed 


Time (years) 


a (mas) 


parameters from orbital fitting. Solid and dotted curves show stable 
orbits for 0.5 My and 1.0 Mj planets, respectively. b, Same as a, for LkCa 
15 c. ¢, Stable orbits for 0.5 My (solid) and 1.0 Mj (dotted) planets. 

mas, milliarcsecond. 


19 NOVEMBER 2015 | VOL 527 | NATURE | 343 


© 2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


However, observing this clump before it sheared out would be a priori 
unlikely, since the viscous timescale at 10 au is just ~3% the age of the 
system. 

Observations argue strongly against this explanation as well. 
Scattering cannot cause increasing emission from H to M band’, since 
dust opacity decreases with increasing wavelength. Furthermore, since 
dust opacity is equal between Ha and the nearby continuum, scatter- 
ing signals have equal contrast in both narrowband filters. Scaling the 
continuum image by the LkCa 15 A Ha-to-continuum flux ratio and 
subtracting it from the Ha image should only lead to an Ha detection 
if scattering is not the emission mechanism. Indeed, this yields a LkCa 
15 b detection with signal-to-noise of 4.8. While the Wollaston beam 
splitter in MagAO’s SDI mode could lead to contamination by polarized 
light, the visible polarized scattered light intensity at b’s position is less 
than ~7% the Ha source flux”*. It could not cause the Ha detection. 
This leaves the multiple-planet scenario as the most natural explanation 
for the data. 

Both the infrared and Ha observations show that we are unambig- 
uously witnessing planet formation in LkCa 15. The data offer evi- 
dence that giant protoplanets undergo a period of high infrared and 
Ha luminosity during their accretion phase. In the near future, ALMA’s 
sensitivity and angular resolution should enable us to detect sub- 
millimetre emission from circumplanetary disks”’. Additionally, while 
the LBT data published here were taken in single-aperture mode (base- 
lines up to ~8 m), non-redundant masking using the co-phased LBTI 
will provide 23-m baselines, allowing us to place tight constraints on 
the companion orbits and to resolve structure at smaller separations. 
Continued monitoring of accretion tracers from LkCa 15 b will probe 
whether the accretion is steady or stochastic. This young system pro- 
vides the first opportunity to study planet formation and disk—planet 
interactions directly. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

LBT observations and data reduction. We observed LkCa 15 using non- 
redundant masking (NRM)!! with LBTI/LMIRCam*°*! in December 2014 and 
February 2015. NRM transforms a conventional telescope into an interferometric 
array through the use of a pupil-plane mask, providing better point-spread func- 
tion (PSF) characterization and resolving angular scales even within A/D. We used 
LMIRCam?s 12-hole mask in single-aperture mode, yielding 1.4—7.0 m baselines. 
We broke up the observations into “visits,” each consisting of an identical set of inte- 
grations on LkCa 15 and an unresolved calibrator star (see Extended Data Table 1). 
We used three calibrators to lessen the possibility of contamination by a binary 
calibrator, and included one calibrator, GM Aur, from those observed previously 
at Keck®. We let the sky rotate throughout the observations, facilitating calibration 
of quasi-static speckles. At Ks and L’ we observed LkCa 15 at parallactic angles 
between —37° and 65°, and —65° and 65°, respectively. 

The NRM images show the interference fringes formed by the mask, the Fourier 
transform of which yields complex visibilities. Sampling the complex visibilities, 
we calculated squared visibilities (power versus baseline) and closure phases (sums 
of phases around three baselines forming a triangle). Closure phases eliminate 
atmospheric phase errors, leaving behind the sum of the intrinsic source phases. 
The LBT raw closure phase scatter was ~3°, significantly lower than the uncertain- 
ties in previous NRM observations® (~4°). 

For each closing triangle, we fitted a polynomial to all calibrator closure phases 

as a function of time. We sampled the polynomial at the time of each target obser- 
vation and subtracted it from each target closure phase. We calibrated using a 
variety of functions; of these, polynomials up to 2nd order in time provided the 
lowest-scatter calibrated closure phases, with standard deviations of 1.7° at Ks and 
1.9° at L’. We calibrated the squared visibilities similarly, dividing by the calibrator 
rather than subtracting. We calibrated the mask baselines using the observed power 
spectra and knowledge of the filter bandpass and plate scale*”. 
LBT image reconstruction, model fitting, and parameter error estimation. We 
fitted models directly to kernel phases***4, linearly independent combinations 
of closure phases, to search for companions. We modelled the central star as a 
delta function and each companion as another delta function located a distance s 
from the star, at position angle PA, and with contrast A. We sampled the synthetic 
complex visibilities at the same baselines and parallactic angles as the data, and per- 
formed a grid fit, using a Ay’ to determine our parameter confidence intervals. Due 
to a known degeneracy between companion separation and contrast*’, brighter 
companions at smaller separations provide equally good fits as those fainter and 
farther out. We thus performed fits to individual wavelengths to verify that the 
positions of b and c agreed across wavelength, then calculated a best fit where 
the companions coincided at Ks and L’ (see Table 1). The model grids in this 
study required ~50,000 CPU hours to generate, but were computed in a reasonable 
amount of time using the University of Arizona's El Gato supercomputer. 

We also reconstructed images from the closure phases. To produce cleaner 

images, we re-calibrated the closure phases towards the best-fit Ks + L’ model 
using an optimized calibrator weighting technique applied in previous NRM 
studies®. This calibration is similar to the locally optimized calibration of images 
(LOCI)** technique applied in direct imaging. Since this scheme can remove signal 
and underestimate errors, we applied it only to produce images (see Extended 
Data Fig. 1), using the polynomial calibration to estimate companion parame- 
ters. As a consistency check, we reconstructed images using both the BiSpectrum 
Maximum Entropy Method (BSMEM)*’ and the Monte-Carlo MArkov Chain 
IMager algorithm (MACIM)**. The companion positions agree between the two 
algorithms, although BSMEM produces more compact emission towards each 
component. BSMEM has been shown to provide the most faithful image recon- 
struction of any available algorithms in blind tests®. 
Companion parameter error estimation for previously-published Keck data. 
Orbital fitting required parameter errors for the previously published® Keck 
observations and the LBT observations to be consistently estimated. The pub- 
lished errors for the 2009-2010 companion parameters were generated using the 
nonlinear algorithm MPFIT“. While nonlinear fitters are often employed for com- 
putational expediency, the Levenberg-Marquardt algorithm can easily fall into a 
local minimum and underestimate parameter errors. The LBT grid y surfaces 
show local minima for both two- and three-companion fits, rendering MPFIT 
unreliable unless the starting parameters were very close to the global minimum. 
We compared MPFIT and grid-based parameter errors for the LBT data, and found 
that MPFIT significantly underestimated the errors (Fig. 2). 

To create a “typical” error bar for each Keck companion, we estimated the error 
bar dependence on contrast using the LBT fits. Errors increased with decreasing 
companion flux, which we parameterized as a square root dependence. For a given 
Keck companion we thus scaled our LBT errors by the square root of the LBT- 
to-Keck flux ratio. We inflated the Keck error bars by a factor of 1.3, the ratio of the 
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uncalibrated closure phase scatter in the Keck data (~4°) to that for the LBT data 
(~3°). We capped the separation upper limits at 3A/D, where D is Keck’s telescope 
diameter, 10 m, since the largest LBT upper limit was at nearly 3A/D, and compan- 
ions at those distances are no longer subject to the separation-contrast degeneracy. 
MagAO data reduction and analysis. We observed LkCa 15 on November 15 
and 22, 2014, as part of the Giant Accreting Protoplanet Survey (GAPplanetS), 
a visible-wavelength survey of bright transition disks. GAPplanetS stars are imaged 
simultaneously in Ha (0.656 um, AA=6 nm) and the nearby stellar continuum 
(0.642 um, AA=6 nm) with the 585-actuator Magellan Adaptive Optics systems 
SDI camera!*!?, The continuum channel provides a sensitive, simultaneous 
probe of the stellar PSE, allowing for effective removal of residual starlight and 
isolation of Ha emitting sources!*“%, The observations used new single-substrate 
narrowband Ha and continuum filters, a significant improvement over the previ- 
ous VisAO SDI filters, which suffered from ghost images”. 

Seeing during the November 15 observations was better than the site median 
(0.56 + 0.06”), winds were low (3.6 + 0.9 mph), and humidity was typical of the 
season (37.0 + 2.8%). Strehl ratio was low (<10%), and difficult to measure 
meaningfully. We characterized image quality using the stellar full-width at half- 
maximum (FWHM), 0.07” (at 0.65 um over 30 s integrations), a significant 
improvement over the seeing. We collected 316 30-s closed-AO-loop images, with 
a total of 156 min of integration time and 48.6° of sky rotation. We selected the 
149 LkCa 15 images with the lowest residual wavefront error (~50%), equivalent 
to 74.5 min of exposure time. This image subset had 47.6° of sky rotation, with the 
rotational space well sampled. 

The November 22 data were not of sufficient quality to recover LkCa 15 b, 
due to lower sky rotation (27.0°), shorter total integration (91 min), and shallower 
individual exposures (15 s). Injected positive planets with the same separation as 
LkCa 15 b were only recoverable with SNR > 3 at contrasts > 5 x 10~* (nearly an 
order of magnitude brighter than the measured November 15 LkCa 15 b contrast). 
For this reason, we discuss only the November 15 data set in the rest of the paper. 

Images were first bias-subtracted, registered, and aligned via cross-correlation. 
The Ha flat field image showed very little non-uniformity across the field (<1%), 
so a flat field was not applied. We masked CCD dust spots visible in the flat field 
wherever they affected the image throughput by more than 2%. 

We processed the aligned data using angular differential imaging (ADI““), com- 
paring the “classical” method of using a single median PSF for all images (cADI*) 
to the Karhunen-Loeve image processing (KLIP“*) algorithm, which calculates 
a least-squares optimum PSF for each image. LkCa 15 b was detected in the Ha 
channel via both methods, as shown in Extended Data Fig. 2. The planet was not 
detected in continuum with either method, so continuum images were used as a 
probe of PSF residuals and scattered light emission from the inner disk. Subtraction 
of the processed continuum images from the Ha images (“ASDI”) left behind only 
true Ha emission’”. 

MagAO cADI reductions. We constructed the stellar PSF by median combining 
images in 0.5° rotational bins and then median combining again to produce a 
PSF evenly sampled in rotational space. We subtracted the stellar PSF from the 
individual images, rotated them to a common on-sky orientation and combined 
them. Given the small separation between LkCa 15 A and b, the planet moved by 
only 1.5 FWHM over the course of the observations, resulting in self-subtraction 
and decreasing the FWHM of the processed planet PSF to 4-5 pixels in azimuth. 

MagAO KLIP-ADI reductions. KLIP reductions were carried out using a well- 
tested custom IDL code’”. To optimize reduction parameters, we maximized the 
signal to noise of injected planets (with the same separation and contrast as LkCa 
15 b) inserted after using a negative planet to erase the LkCa 15 b signal. Planets 
were placed at position angles distant from known artefacts, and east or west of 
the star to avoid the noisier north/south region of the PSE, corresponding to the 
wind direction during the observations. 

To limit self-subtraction, the library from which KLIP builds the stellar PSF 
is limited to images where a planet would have rotated away from its original 
position. We explored the size of this exclusion region (“rotational mask”) sys- 
tematically through fake planet injection, and found that a 5° mask (~1 pixel at 
r=11 pixels) produced the highest signal-to-noise recoveries of injected planets. 
Given the stellar FWHM of 0.07, this resulted in azimuthal self-subtraction, with 
a processed planetary PSF of 2 pixels in azimuth. 

Noise in the KLIP processed images was mostly Gaussian when images were 
divided into several independently-optimized radial zones, indicating efficient 
removal of speckles. Dividing these zones azimuthally provided no additional 
advantage, and the final KLIP reductions shown in Extended Data Fig. 2 reflect a 
PSF divided into 50-pixel (0.4”) annuli. Removal of the median PSF radial profile 
for the entire image set aided significantly in attenuating the stellar halo, improv- 
ing the ability of the KLIP algorithm to match residual speckles and enhancing 
contrast close to the star. 
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MagAO LkCa 15 b photometry and astrometry. We estimated photometry and 
astrometry by minimizing residuals after injecting a negative planet at the location 
of LkCa 15 b. The cube of registered and aligned Ha channel images was scaled 
by the chosen contrast value, multiplied by —1, and injected into the raw images 
before KLIP processing. Using the full Ha image cube rather than its median 
combination simulated variability of the PSF between images. 

We generated error bars by injecting false positive planets with similar sepa- 

rations and contrasts to LkCa 15 b after using a negative planet to eliminate the 
true signal. Planets were placed at position angles away from the wind direction, 
and spaced by at least 75% of the measured stellar FWHM. We computed the 
centroid and peak pixel using a 5-pixel aperture around each planet, and assigned 
the standard deviations in recovered flux and position as our 1¢ photometric and 
astrometric uncertainties, respectively (see Extended Data Table 2 and Extended 
Data Fig. 3). 
Signal-to-noise of the MagAO Ha detection. To create signal-to-noise ratio 
(SNR) maps, we calculated a radial noise profile using the standard deviation of 
1-pixel-wide annuli and divided it into the raw images. In the raw maps, LkCa 
15 b has SNR 3-4. Smoothing by a Gaussian with a 2-pixel FWHM maximized 
the SNR of injected fake planets, so we applied this smoothing to the final science 
images, resulting in peak SNRs of 4.4 and 6.8 in the KLIP Ha and ASDI images, 
respectively. However, directly-imaged exoplanets at small separations suffer from 
small number statistical effects*®. The underlying speckle distribution is difficult 
to probe given the small number of independently sampled noise regions. In an 
annulus at the distance of LkCa 15 b (1.3 FWHM), seven noise regions exist, lead- 
ing to corrected*® SNRs of 4.1 and 6.4 for the Ha and ASDI images, respectively. 
The ASDI detection corresponds to a false positive probability of 3 x 10-4 using 
the Student's t-distribution with 6 degrees of freedom. 

Comparing the LkCa 15 b SNR to the distribution of values in the ASDI SNR 
map (Extended Data Fig. 4), shows that it is a clear outlier. Comparison of the 
peak pixel in an aperture centred on b to those in the surrounding noise apertures 
(Extended Data Figs 4, 5) further demonstrates b’s statistical significance. 

In addition to the high SNR, low false positive fraction, and the statistics pre- 
sented in Extended Data Fig. 4, the Ha detection is significant because it occurs 
at the same location as the independent LBT detection. This further reduces the 
probability of a false positive detection in the MagAO data, since speckles have 
no preferred location. 

Fidelity of the MagAO LkCa 15 b detection. Neither the existence of LkCa 15 b 
nor its derived parameters are dependent on our choice to include only the top 
50% of raw images. The planet appears at the same location and with the same 
approximate brightness when processing all 316 images, as well as only the top 25% 
of images. An excess with SNR >3 appears at LkCa 15 b’s location with a wide range 
of KLIP zone geometries and rotational masks, when any number of KL modes 
from 2 to 100+ are removed, and whether or not the median radial profile of the 
PSF is subtracted before processing. 

Limits on MagAO LkCa 15 b continuum flux. We used simulated planet detec- 
tions to place an upper limit on LkCa 15 b’s continuum flux. We injected planets 
into the raw continuum channel images with a range of contrasts and at positions 
near LkCa 15 b. We then measured the SNR of each simulated detection to deter- 
mine the confidence at which we could detect a given contrast. As above, we apply 
a small number statistical correction*® to the SNR of each recovered planet. The 
simulations suggest that we would have detected an excess with a corrected SNR 
of 3 (false positive fraction of 10-7) for a continuum source with contrast greater 
than 5 x 107+. Since LkCa 15 A is 1.8 times brighter at Ha than continuum, this 
corresponds to an Ha-to-continuum-flux ratio lower limit of 2.7. 

Limits on MagAO LkCa 15 c Ha contrast. We established limits on the LkCa 15 c 
Ha contrast using false planet injections, first using a negative planet to eliminate 
the LkCa 15 b signal. We injected planets with a range of contrasts at positions 
sampling the LBT error ellipse for LkCa 15 c. At position angles between —40 and 
—52°, several 2-2.50 peaks near c’s location boost the SNRs for recovered planets. 
Here, we can detect contrasts down to 2 x 10~? with corrected** SNRs of 3 (false 
positive fraction of 2 x 10-7). Position angles greater than — 40° approach a nois- 
ier region of the PSF, leading to decreased sensitivity; here contrasts of 6 x 10-3 
are required for adjusted signal-to-noise ratios of 3. We cannot reject or confirm 
accretion onto LkCa 15 c below 6 x 10? contrast (AHa =5.6 mag) with the current 
data set. This improves upon previous spectro-astrometric observations, which 
yielded a contrast limit of AHa = 3.4 mag for a position angle near LkCa 15 c’”. 
Stability analysis with LkCa 15 d. We ran a series of orbit integrations to 
demonstrate that stable solutions exist for b, c, and d at separations within the 
1o semimajor axis error bars (see Extended Data Figs 6, 7). We used the publicly 


available Swifter package**-specifically, the symplectic integrator, Sy MBA™, which 
switches to a Burlisch-Stoer algorithm for planetary close approaches. We also ran 
comparison integrations with the Gauss Radau 15th order integrator and found 
comparable results, with minimum energy conservation of 1 part in 10” over a 
10 Myr integration. 

We required all orbits to be nearly co-planar, with a random scatter <1°, and 
assigned each planet a random eccentricity below 10~*. To assess stability we inte- 
grated three different random phase combinations for 10 Myr. We found stable three 
body solutions out to 1-2 Myr with semi-major axes of a) = 12.7 AU, ac-= 18.6 AU, 
aq= 24.7 au. To ensure stability out to 10 Myr with orbits in the 1o errors requires 
that all planets be <0.5 Mj. A wider range of orbits is allowed if d’s mass is 
decreased further. These constraints are in line with previous large numerical 
studies of equally spaced (in Rim), equal-mass planets*!. Note for planets b and c, 
there are possible resonant configurations within the predicted period ranges, 
which would admit somewhat higher masses. 
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Extended Data Figure 1 | Image reconstructions. a—d, Images 
reconstructed from closure phases, showing Ks polynomial (a) and LOCI- 
like (b) calibrations, and L’ polynomial (c) and LOCI-like (d) calibrations. 
Both calibrations yielded reconstructed images with at least two distinct 
components. The LOCI-like calibration moved each companion within the 
position errors derived from the grid y’ surface. 
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Extended Data Figure 2 | KLIP and ADI Ha SNR maps. a-c, Final KLIP The presence of dark holes in the maps suggests that we are speckle- 


SNR maps for Ha (a), continuum (b) and the difference between the dominated out to the AO control radius at r= 20 pixels (white, dashed 

two (ASDI, c). d-f, Final cADI SNR maps in the same order. Dividing circles). LkCa 15 b’s separation is 11.6 pixels. The yellow keystones indicate 
by the radial noise profiles to create these maps should normalize the the 20 range of allowed astrometry for the KLIP ASDI point source (upper 
noise distribution at all radii within the speckle-dominated regime. right) based on negative simulated planet injection. 
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Extended Data Figure 3 | False positive planet SNR maps. a, LkCa 15 11 pixels and contrast of 8 x 10-3. Recovered parameters for these 
final ASDI SNR map. b, ASDI SNR map with LkCa 15 b removed. planets are given in Extended Data Table 2 and were used to determine 
c-h, ASDI SNR maps of false positive planets injected at a radius of lo astrometric and photometric uncertainties. 
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Extended Data Figure 4 | Ha detection noise statistics. a, Histogram 
of noise (non-planet) pixel values in the SNR map within the speckle 
dominated regime (black line) compared to a Normal distribution (red 
line). The black arrow denotes the location of the peak SNR value for LkCa 
15 b. b, Histogram of the peak values in all noise apertures (see Extended 
Data Fig. 5) within the control radius (black line) compared to a Normal 
distribution (red line). The black arrow shows the peak pixel value in the 
LkCa 15 b aperture. 
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Extended Data Figure 5 | Noise apertures. Noise apertures (black circles) 
surrounding LkCa 15 A used to calculate the statistics presented in 
Extended Data Fig. 4. Colour indicates SNR. 
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Extended Data Figure 6 | LkCa 15 d position angle and separation 
versus time. Evolution of position angle and separation (inset) for LkCa 
15 d. Green and red points indicate Ks and L’ data, respectively. In both 
panels, the earliest three points correspond to previously published Keck 
observations’, and the most recent points show best fits to our data. The 
coloured error bars are derived using the nonlinear algorithm MPFIT, 
which significantly underestimates the parameter errors compared to the 
more robust grid, Ay? (black error bars; see Methods). The yellow shaded 
region spans the position angles and separations allowed at 1o by the 
multi-epoch observations, which have semi-major axes between 12.6 and 
24.7 au. Solid curves show the best-fit orbit (18.0 au), and dashed curves 
show an orbit (24.7 av) that is stable for a 0.5 Mj planet exterior to LkCa 
15 b and c. Lower mass planets or resonant configurations permit stable 
orbits for LkCa 15 d at smaller stellocentric radii. 
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Extended Data Figure 7 | Orbital integration results. a, Stable orbits for 24.7 au, initial eccentricities of order 10~°, and relative inclinations of <1°. 
LkCa 15 b, c, and d over a 10 Myr integration. b, Osculating eccentricity. After a 10 Myr integration, the eccentricities of c and d have increased to 
The planets are each 0.5 My with initial semi-major axes of 12.7, 18.6, and only a few percent. 
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Extended Data Table 1 | Summary of observations 


Target Right Ascension Declination tint Nérames * Nyisits | Total Time Average Seeing 
(hh mmss.sss) (ddmmss.sss) (s) (h) (asec) 


2014 Nov 15: Ha and 642 nm continuum 


LkCa 15 04 39 17.796 +22 2103.48 30 316 1 2.63 0.56 
2014 Nov 22: Ha and 642 nm continuum 

LkCa 15 04 39 17.796 +22 2103.48 15 364 1 1.52 N/At 
2014 Dec 15: L’ 

LkCa 15 04 39 17.796 +22 2103.48 10 40 15 1.67 0.76 
HD284668 04 42 09.686 +221355.62 10 40 5 0.56 

HD284581 04 40 32.495 +22 3132.88 10 40 4 0.44 

GM Aur 04 55 10.983 +30 2159.54 10 40 5 0.56 

2015 Feb 5-7: Ks 

LkCa 15 04 39 17.796 +22 2103.48 20 20 19 2.11 0.95 
HD284668 04 42 09.686 +221355.62 20 20 iA 0.78 

HD284581 04 40 32.495 +22 3132.88 20 20 7 0.78 

GM Aur 04 55 10.983 +30 2159.54 20 20 6 0.67 


*Number of frames in each visit. 
tEach visit consists of all images taken before switching between target and calibrator. 
{The seeing monitor was unavailable during the 22 November observations. 
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Extended Data Table 2 | False planet injection results 


PA. * Xint Vint Xree8 — Vrecll «=AP.AS As# Peak SNR 
a) (pix) (pix) — (pix) ~— (pix) (°) (pix) 
77 135.2 127.0 136.0 127.3 -0.53 -0.85 8.9 
-44 132.1 132.4 132.8 132.8 wa -0.78 6.2 
38 117.7 133.2 1162 1343 -2.25 -1.80 4.3 
73 1140 127.7 112.3 127.8 -1.81 -1.66 4.6 
108 114.0 121.1 113.9 122.0 4.67 0.15 5.5 
143 W739 WS5/ Ws ‘TT16:7 2.50 0.91 4.7 
Simulated Planet Means 0.62 -0.67 5.7 
LkCa 15 b Fit Results 
x ¥ PA. s Peak SNR 

Parameters 135.8 121.8 -103.4 11.6 6.8 
1 o Errors +2.7 +1.0 +30% 


«Input false planet position angle. 
TInput false planet X location. 
tInput false planet Y location. 
§Recovered false planet X location. 
| | Recovered false planet Y location. 


qRecovered minus input false planet position angle. 
#Recovered minus input false planet separation. 
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Measurement of interaction between antiprotons 


The STAR Collaboration* 


One of the primary goals of nuclear physics is to understand the 
force between nucleons, which is a necessary step for understanding 
the structure of nuclei and how nuclei interact with each other. 
Rutherford discovered the atomic nucleus in 1911, and the large 
body of knowledge about the nuclear force that has since been 
acquired was derived from studies made on nucleons or nuclei. 
Although antinuclei up to antihelium-4 have been discovered! 
and their masses measured, little is known directly about the 
nuclear force between antinucleons. Here, we study antiproton 
pair correlations among data collected by the STAR experiment? 
at the Relativistic Heavy Ion Collider (RHIC)3, where gold ions 
are collided with a centre-of-mass energy of 200 gigaelectronvolts 
per nucleon pair. Antiprotons are abundantly produced in such 
collisions, thus making it feasible to study details of the antiproton- 
antiproton interaction. By applying a technique similar to Hanbury 
Brown and Twiss intensity interferometry’, we show that the force 
between two antiprotons is attractive. In addition, we report 
two key parameters that characterize the corresponding strong 
interaction: the scattering length and the effective range of the 
interaction. Our measured parameters are consistent within errors 
with the corresponding values for proton-proton interactions. Our 
results provide direct information on the interaction between two 
antiprotons, one of the simplest systems of antinucleons, and so 
are fundamental to understanding the structure of more-complex 
antinuclei and their properties. 

Although the theory of quantum chromodynamics (QCD) provides 
us with an understanding of the foundation of the nuclear force, this 
binding interaction in nuclei operates at low energy, where the force is 
strong and difficult to calculate directly from the theory (see ref. 5 and 
references therein for recent developments). For that reason, a com- 
mon parameterization of the effective interaction between nucleons is 
based on experimental measurements, and the corresponding param- 
eterization for antinucleons remains undetermined. The important 
paramfeters in such a description of the interaction are the scattering 
length (fo), which is related to elastic cross-sections, and the effective 
range of the interaction (do), which is determined to be a few femto- 
metres (the typical nuclear scale). For a short range potential, these two 
parameters are related to the s-wave scattering phase shift 5p and the 


collision momentum k by kcot(59) = r + =dyk’. The existence and 
0 


production rates of antinuclei offer indirect information about inter- 
actions between antinucleons, and also have relevance to the unex- 
plained baryon asymmetry in the Universe. Antinuclei produced to 
date include antiprotons, antideuterons, antitritons, antihelium-3, and 
the recently discovered antihypertriton and antihelium-4 (see ref. 1 
and references therein). The interaction between two antinucleons is 
the basic interaction that binds the antinucleons into antinuclei, and 
this has not been directly measured previously. Of equal importance, 
one aspect of the current measurement is a test of matter—antimatter 
symmetry, more formally known as CPT—a fundamental symmetry 
of physical laws under the simultaneous transformations of charge 
conjugation (C), parity transformation (P) and time reversal (T). 
Although various prior CPT tests’ have been many orders of 


magnitude more precise than what is reported here, there is value in 
independently verifying each distinct prediction of CPT symmetry’. 

Ultra-relativistic nuclear collisions produce an energy density sim- 
ilar to that of the Universe microseconds after the Big Bang, and the 
high energy density creates a favourable environment for antimatter 
production. The abundantly produced antiprotons provide the oppor- 
tunity to measure, for the first time, the parameters fy and dp of the 
strong nuclear force between antinucleons rather than nucleons. 

The technique used to probe the antiproton—antiproton interaction 
involves momentum correlations, and it resembles the space-time cor- 
relation technique used in HBT (Hanbury Brown and Twiss) intensity 
interferometry. Since its invention for use in astronomy in 
the 1950s‘, the HBT technique has been adopted in many areas of 
physics, including the study of the quantum state of Bose-Einstein 
condensates®, and the correlation among electrons’ and among atoms 
in cold Fermi gases’”. A Bose-Einstein enhancement in particle physics 
was first observed in the late 1950s as an enhanced number of pairs of 
identical pions produced with small opening angles, the GGLP 
(Goldhaber, Goldhaber, Lee and Pais) effect!!. Later on, Kopylov and 
Podgoretsky noted the common quantum statistics origin of the HBT 
and GGLP effects!, and, through a series of papers (see a review? and 
references therein), they devised the basics of the momentum correla- 
tion interferometry technique. In this technique, they introduced the 
correlation functions (CFs) as ratios of the momentum distributions 

F P(p,, : 
of correlated and uncorrelated particles, C(p, , P,) = ree with 
C=1 for no correlations, suggested the so-called mixing technique to 
construct the uncorrelated distribution by using particles from different 
collisions (events), and formulated a simple relation of the CFs with the 
space-time structure of the particle emission region. Here C(pi, pz) is 
the correlation function, P(p;) and P(p2) are probabilities for detecting 
a particle with momentum p, and a particle with momentum pp, 
respectively, and P(pj, p2) is the joint probability for detecting both 
simultaneously. As a result, the momentum correlation technique has 
been widely embraced by the nuclear physics community'*"!”, 

Figure 1 illustrates the process of constructing two-particle correla- 
tions in heavy-ion collisions. In addition to quantum statistics effects, 
final state interactions (FSIs) play an important role in the formation 
of correlations between particles. FSIs include, but are not limited to, 
the formation of resonances, the Coulomb repulsion effect, and the 
nuclear interactions between two particles!#151819, In fact, FSI effects 
provide valuable additional information. They allow for (see refs 16, 
20 and references therein) coalescence femtoscopy, correlation fem- 
toscopy with non-identical particles, including access to the relative 
space-time production asymmetries, and a measurement of the strong 
interaction between specific particles. The last measurement is often 
difficult to access by other means and is the focus of this paper (for 
recent studies see refs 21, 22). 

Ina semi-classical geometrical description, a complex heavy-ion col- 
lision can be regarded as a superposition of many individual nucleon- 
nucleon collisions, each governed by a constant probability of interaction 
with all nucleons travelling in straight lines. The centrality corresponds 
to the extent that two nuclei overlap, and events are categorized by their 


*Lists of participants and their affiliations appear at the end of the paper. 
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Figure 1 | A schematic of the two-particle correlation process in a 
heavy-ion collision. The display is overlaid on an event display from 

the Time Projection Chamber in the STAR detector. The curves show 
particle trajectories, from which the track momenta are determined. These 
trajectories are measured in three dimensions, but are projected onto a 
single plane in this illustration. The STAR detector measures three-vector 
momenta over a wide range beginning at about 0.1 GeV c_!. Two particles 
emitted from two separated points, with four-coordinates X, and X;, are 
detected with four-momenta p; and py». For the pair of indistinguishable 
particles with even/odd total spin, the two quantum mechanical amplitudes 
(representing, for non-interacting particles, products of plane waves (p;|X,) 
(p2|Xp) and (p2|X,q) (pi|Xp), where (p|X) = exp(—ipX)) must be added/ 
subtracted to yield the amplitude which is symmetric/antisymmetric 

with respect to the interchange of particle momenta. This results in 

an enhancement/suppression in the joint detection probability at zero 
momentum separation with the width inversely proportional to the space- 
time separation of particle emission points. 


centrality, based on the observed number of tracks emitted from each 
collision. Zero per cent centrality corresponds to exactly head-on colli- 
sions which produce the most tracks, while 100% centrality corresponds 
to barely glancing collisions which produce the fewest tracks. The data 
used here consists of Au+ Au collisions at a centre-of-mass energy of 
200 GeV per nucleon pair, taken during the operation of RHIC in the 
year 2011. In total, 500 million events were taken by the minimum-bias 
trigger at STAR. This trigger selects all particle-producing collisions, 
regardless of the extent of overlap of the incident nuclei, but with a 
requirement that collisions must have occurred along the trajectory of 
the colliding Au ion and within +30 cm of the centre of STAR’s Time 
Projection Chamber (TPC)*%. Events used in this analysis correspond 
to the 30%-80% centrality class, for which the signal due to two-particle 
interaction is stronger than that from smaller centrality classes, while 
particle yields are larger than that from larger centrality classes. 

The two main detectors used in the measurement are the STAR TPC 
and the Time of Flight Barrel (TOF)**. The TPC is situated in a sole- 
noidal magnetic field (0.5T), and it provides a three-dimensional 
image of the ionization trails left along the path of charged particles. 
The TOF encloses the curved surface of the cylindrical TPC. In con- 
junction with the momentum measured via the track curvature in 
TPC, particle identification (PID) is achieved by two key measure- 
ments: the mean energy loss per unit track length, (dE/dx), which can 
be used to distinguish particles with different masses or charges, and 
the time of flight of particles reaching the TOF detector, which can be 
used, together with tracking information, to derive the square of a 
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Figure 2 | m? versus n,_ for negatively charged particles. Here 

m? = (p’/c’)(#7c?/L? — 1), where t and L are the time of flight and path 
length, respectively. c is the light velocity. z=1In({ dE/dx) dE/dx) g) and 
(dE/dx) g is the expected value of (dE/dx) for (anti)protons. 0, is the rm.s 
width of the z distribution, andn, is the number of standard deviations 
from zero, the expected value of z for (anti)protons. The antiprotons, 
centred at m?=0.88 (GeV c *)? andn, =0, are well separated from other 
particle species. (Anti)protons satisfying 0.8 (GeV c~2)?< mass? <1 

(GeV c *)’ and|n, | < 1.5 are selected for making pairs. With this 
selection, the purity is >99% for (anti)protons with transverse momentum 
less than 2 GeV c_!. Colours denote particle population (counts) in cells 
formed by even division of m’ andn,. 


particle’s mass (m7). Figure 2 shows a typical calculated mass-squared 
(m?) distribution versus n, (see Fig. 2 legend) for antiprotons. 

The population distribution of (anti)proton pairs as a function of 
(anti)proton momentum (k’) in the pair rest frame (in which the centre 
of mass of the pair is at rest, convenient for carrying out measurements) 
is measured for the correlated pairs from within the same event, A(k’), 
and, separately, for the non-correlated pairs from two different (mixed) 
events, B(k’). The former corresponds to the joint probability P(py, p2), 
and the latter corresponds to the product of two probabilities, 
P(p\)P(p2), where P(p;) and P(p2) each corresponds for observing single 
(anti)protons. The ratio of the two, A(k’)/B(k’), gives the measured CF 
(see Methods). The observed (anti)protons can come from weak decays 
of already correlated primary particles, hence introducing residual cor- 
relations which contaminate the CF. The dominant contaminations to 
the CF come from the p-A (p—A) and A-A (A —A) correlations (where 
pand A denotes the proton and lambda particle, respectively, and p and 
A denotes the corresponding antiparticle), and are taken into account 
by fitting the CF with corresponding contributions. Taking the two-pro- 
ton correlation measurement as an example”, 


Crnainaiyeh") =1 +2 pp|Cop(k*s Rp) ~ 1]+xpalCpa(k"sRpa) a 1| 
+ 4|Caa(k) = 1] (1) 


where Cinclusive(k ) is the inclusive CE, and Culk s Rpp) is the true 
proton-proton CF, which can be described by the Lednicky and 
Lyuboshitz analytical model!’. In this model, for given s-wave scatter- 
ing parameters, the correlation function with FSI is calculated as the 
square of the properly symmetrized wavefunction averaged over the 
total pair spin and the distribution of relative distances of particle emis- 
sion points in the pair rest frame (see Methods). C are the residual CFs 
which are expressed through the p-A and A-A CFs, C " alk DAB R " ayand 
Cya(ki,)> using integral transformation” from k* jandks, tok*, (see 
Methods). C . A(kigR . ,)is taken from a theoretica calculation)’, which 
includes all final-state interactions and explains experimental data 
well?! C, ,(kx,) is from an experimental measurement corrected for 
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mis-identified As (ref. 22). Rp, and R,a, assumed to be the same numer- 
ically, are the invariant Gaussian radii”! from the proton—proton cor- 
relation and the proton—A correlation, respectively. Xpp» Xpa and x,a, 
taken from the THERMINATOR2 model”®, are the relative contribu- 
tions from pairs with both daughters from the primary collision, pairs 
with one daughter from the primary collision and the other one from 
a A decay, and pairs with both daughters from a A decay, respectively. 

Figure 3 shows the CF for proton-proton pairs (Fig. 3a) and 
antiproton-antiproton pairs (Fig. 3b), for the 30%-80% centrality 
class of Au+ Au collisions at a centre-of-mass energy of 200 GeV 
per nucleon pair. The proton-proton CF exhibits a maximum at 
k* 0.02 GeV cc! due to the attractive singlet s-wave interac- 
tion between the two detected protons and is consistent with pre- 
vious measurements’’. The antiproton-antiproton CF shows 
a similar structure with the maximum appearing at the same 
k* value. In Fig. 3c, the ratio of the inclusive CF for proton- 
proton pairs to that of antiproton-antiproton pairs is presented. 
It is well centred at unity for almost all the k° range, except for 
the region k° <0.02GeVc_!, where the error becomes large. 
This indicates that the strong interaction is indistinguishable within 
errors between proton-proton pairs and antiproton—antiproton pairs. 
By fitting the CF with equation (1), we determine the singlet s-wave 
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Figure 3 | Correlation functions and their ratio. a, b, Correlation 
functions for proton—proton pairs (a) and antiproton-antiproton pairs (b). 
The ratio of the former to the latter is shown in c. Errors are statistical only. 
The fits to the data with equation (1), Cincusive(k ), are plotted as solid lines, 
and the term 1 + Xppl Cop (k's Ryp) — 1] is shown as dashed lines. The x per 
number of degrees of freedom of the fit is 1.66 for a and 1.61 for b. 

To take advantage of the existing knowledge on the proton-proton 
interaction, which is relatively well understood, when fitting the 
proton-proton correlation, fy and do for protons are fixed at values 
measured from proton-proton elastic-scattering experiments, which 

are 7.82 fm and 2.78 fm, respectively”. When fitting the antiproton- 
antiproton correlation, fo and do are treated as free parameters. 
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Figure 4 | dy versus fo for (anti)nucleon-(anti)nucleon interactions. 

The singlet s-wave scattering length (fo) and the effective range (dp) for 

the antiproton-antiproton interaction (red star) is plotted together with 

the s-wave scattering parameters for other nucleon-nucleon interactions. 
Here, statistical errors are represented by error bars, while the horizontal 
uncertainty for fo is smaller than the symbol size, and systematic errors are 
represented by the box. Errors on other measurements”? are of the order of 
a few per cent, smaller than the symbol size. 


scattering length and effective range for the antiproton-antiproton 
interaction to be fo=7.41 £0.19(stat.) + 0.36(sys.) fm and dj =2.14+ 
0.27(stat.) + 1.34(sys.) fm, respectively. Here stat. and sys. indicate statis- 
tical and systematic errors, respectively. The extracted radii for protons 
(Rpp) and that for antiprotons ( R5p) are 2.75 +0.01(stat.) + 0.04(sys.) fm 
and 2.80 + 0.02(stat.) + 0.03(sys.) fm, respectively. 

Figure 4 presents the first measurement of the antiproton-antiproton 
interaction, together with prior measurements for nucleon-nucleon 
interactions. Within errors, the fy and do for the antiproton—antiproton 
interaction are consistent with their antiparticle counterparts—the ones 
for the proton-proton interaction. Our measurements provide para- 
meterization input for describing the interaction among cold-trapped 
gases of antimatter ions, as in an ultracold environment, where s-wave 
scattering dominates and effective-range theory shows that the scatter- 
ing length and effective range are parameters that suffice to describe 
elastic collisions. The result provides a quantitative verification of 
matter—antimatter symmetry in the important and ubiquitous con- 
text of the forces responsible for the binding of (anti)nuclei. Possible 
future improvement of the measurement could be made by reducing 
the uncertainty from the A—A CF (Cy, atk )), which dominates our 
systematic error, by further accumulation of data. In addition, a sim- 
ilar extraction of fy and dy could also be repeated with (anti)proton- 
(anti)proton CF*® measured at the Large Hadron Collider, where the 
yield ratio of antiproton to proton is close to unity. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Event mixing for non-correlated pairs and the correction for purity. Non- 
correlated pairs each consist of two daughter particles. These daughters belong 
to two events which are carefully chosen so that they have similar event multi- 
plicity and topology. The ratio A(k’)/B(k’) (see above), after being normalized at 
a large k* (at least 0.25 GeV c-}), gives the measured CF, C(k’) meas: Because in 
practice one cannot select 100% pure (anti)protons, a correction to pairs is applied 


to obtain the PID-purity-corrected CF: Cpuritycomected(K*) = we +1. 


For simplicity, in equation (1) the subscript “meas” is dropped, and elsewhere in 
this paper, the subscript “PurityCorrected” is dropped. 

The transformation from k;, and ki, tok;,. The residual CF CG, (k*) in equa- 
tion (1) is naturally expressed as an integral transformation of the parent 
CF C,,(kp,)- Here kp, (and k* = kp,) is the magnitude of the three-e momentum 
of either particle in the pair rest frame, while in this case for k*,, one of 
the protons is the decay daughter of A. This transformation is done by 
Coalkpp) = SCpalkpa) T(kpas kbp) Ake, where T( pA: Kpp) is a matrix that trans- 


Pl 
forms Kea to Kop (ref. 25). The transformation matrix is generated with the 


THERMINATOR2 model” which is a Monte Carlo event generator dedicated 
to studies of the statistical production of particles in relativistic heavy-ion colli- 
sions. The same procedure is also used in the transformation from kj, to k},. 
The calculation of the FSI contribution to the correlation function. The fem- 
toscopic correlations due to the Coulomb FSI between the emitted electron and 
the residual nucleus in beta decay have been well known for more than 80 years; 
they reveal themselves in a sensitivity of the Fermi function (an analogue of 
the CF*!) to the nuclear radius. Compared with non-interacting particles, the 
FSI effect in a two-particle system with total spin S manifests itself in the substi- 
tution of the product of plane waves, exp(—ipiX, — ip2Xp), by the non- 
symmetrized Bethe-Salpeter amplitudes ve (Xx, ,X,) = cae. , X;) (refs 
14, 19, 32, 33). For identical particles, the symmetrization requirement in the 
representation of total pair spin S takes on the same form for both bosons and 
fermions: the non-symmetrized amplitude should be substituted by 
[Ppp Xe X,)+(-08 Yop Oe ,X,)] /V2 - In the ia eco X,— Xy= 
{t,r}and p — p, = {w;' — w3 , 2k*} where w;' = (m; +k**)’* is the energy of 
a particle of mass m,, and f° and r’ are the relative emission time and relative 
separation in the pair rest frame, respectively. In this frame, the non-symmetrized 
Bethe-Salpeter amplitude at equal emission times (t° = 0) reduces, up to an ines- 
sential phase factor, to a stationary solution of the scattering problem, yr. 
At small relative momenta, k*<~1/r’, this solution can be used in practical cal- 
culations with the condition |t*| < mr (refs 19, 32). The equal-time approxi- 
mation is almost exact in beta decay, and it is usually quite accurate for particles 
produced in high-energy collisions (to a few per cent in the FSI contribution to 
CFs of particles even as light as pions”). In collisions involving heavy nuclei, the 
characteristic separation of the emission points, r’, can be considered substantially 
larger than the range of the strong-interaction potential. The FSI contribution is 
then independent of the actual potential form and can be calculated analytically 
with the help of corresponding scattering amplitudes only**. At small k’, it is 
basically determined by the s-wave scattering amplitudes f*(k") scaled by the 
separation r’ (ref. 19). 

The analytical calculation of the (anti) proton-(anti) proton correlation 
function. The (anti)proton—(anti)proton correlation function, Cop(k's Rpp) in 
equation (1), can be described by the Lednicky and Lyuboshitz analytical 
model’, In this model, the correlation function is calculated as the square of the 
properly symmetrized wavefunction averaged over the total pair spin S and 
the distribution of relative distances (r*) of particle emission points in the pair 
rest frame, assuming 1/4 of the singlet and 3/4 of triplet states and a simple 
Gaussian distribution dN /d’r* = exp(—r*?/ (4R;,,)) . Starting with the FSI 
weight of nucleons emitted with the separation r’ and detected with the relative 
momentum k’, 


w(k* 2%) = |yO(r*) + (1p y(n") P/2 
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where wOler) is the equal-time (f° =0) reduced Bethe-Salpeter amplitude which 
can be approximated by the outer solution of the scattering problem!”"». This is 


pO (rt) = el [A-(n) 


where 7 =(k'a,)~', ac-=57.5 fm is the Bohr radius for two protons, p=k'r’, 
&€=k'r' +p, A-(n) is the Coulomb penetration factor given by A.(y) =2n7 
[exp(2my)—1]~1, F is the confluent hypergeometric function, 


G(p,1) = A. (1) [Go(p, 7) + iFy(p, 7)] is a combination of the regular (Fo) and 
singular (Go) s-wave Coulomb functions, 


e**"E(—in, 1, if) ai (ey Seed 


—1 


1 1 2 
pt tok? 
0 c 


is the s-wave scattering amplitude renormalized by the Coulomb interaction, 
and h(q) = 17 52, (n(n? +y7)] —C— In |n| (here C= 0.5772 is the Euler 
constant). The dependence of the scattering parameters on the total pair spin S is 
omitted since only the singlet (S = 0) s-wave FSI contributes in the case of iden- 
tical nucleons. The theoretical CF at a given k’ can be calculated as the average 
FSI weight (w(k’, r°)) obtained from the separation r’, simulated according to the 
Gaussian law, and the angle between the vectors k* andr’, simulated according 
to a uniform cosine distribution. This CF is subject to the integral correction” 
—A.(n)|f (kK) P do/ (8./Rpp) due to the deviation of the outer solution from the 
true wavefunction in the inner potential region. In addition, in Au+ Au collisions 
the emitting source has a net positive charge, and it influences the CF differently 
for proton and antiproton pairs. This effect is included in the consideration 
according to refs 32, 33. 

Systematic uncertainties. The systematic uncertainties include variations due to 
track-wise and pair-wise cuts, the uncertainty in describing the C,, correlation 
function*, and the uncertainty from the Cj, measurement. The latter dominates 
the systematic error of dp and fo, and it affects dy more than it does fy because the 
shape of the CF is sensitive to do, in particular at low k’. As a consistency check, 
when fitting the proton-proton CF, both fo and dp are also allowed to vary freely, 
and the fitted fo and dp agree with the results from fitting the antiproton-antiproton 
CE Assuming the measurements from different systematic checks follow a uniform 
distribution, the final systematic error is given by (maximum — minimum) /./12. 
In our calculations, we consider the two-proton wavefunction, taking into account 
the Coulomb interaction between point-like protons in all orbital angular momen- 
tum waves and the strong interaction in the s-wave only. We neglect the small 
non-Coulomb electromagnetic contributions due to magnetic interactions, vacuum 
polarization, and the finite proton size*”*”**, This approximation changes the 
scattering parameters at the level of a few per cent””*”3*. The decomposition of 
systematics from our analysis can be found in Extended Data Table 1. 

Sample size. No statistical methods were used to predetermine sample size. 


(k*) = ik*A, (1) 
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Extended Data Table 1 | The decomposition of systematic errors 


Afo (£ im) Adg (4 fm) ARag (= fm) ARpp (= im) 


experimental cuts 0.14 0.33 0.01 0.03 
uncertainty of p-A CF 0.17 0.19 0.03 0.01 
uncertainty of A-A CF 0.36 1.34 0.03 0.03 
THERMINATOR2 model 0.07 0.09 <0.01 < 0.01 


The table presents systematic uncertainties for fo and do for antiproton—antiproton interaction, and R for both 
proton-proton and antiproton—antiproton interaction. Errors are listed separately by their sources. Assuming the 
measurements (fo, do and R) from different systematic checks follow a uniform distribution, the systematic error is 
given by (maximum measurement — minimum measurement)/J12. 
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Nanostructure surveys of macroscopic specimens 
by small-angle scattering tensor tomography 


Marianne Liebi!, Marios Georgiadis*, Andreas Menzel!, Philipp Schneider*, Joachim Kohlbrecher!, Oliver Bunk! & 


Manuel Guizar-Sicairos! 


The mechanical properties of many materials are based on the 
macroscopic arrangement and orientation of their nanostructure. 
This nanostructure can be ordered over a range of length scales. 
In biology, the principle of hierarchical ordering is often used to 
maximize functionality, such as strength and robustness of the 
material, while minimizing weight and energy cost. Methods for 
nanoscale imaging provide direct visual access to the ultrastructure 
(nanoscale structure that is too small to be imaged using light 
microscopy), but the field of view is limited and does not easily 
allow a full correlative study of changes in the ultrastructure over 
a macroscopic sample. Other methods of probing ultrastructure 
ordering, such as small-angle scattering of X-rays or neutrons, 
can be applied to macroscopic samples; however, these scattering 
methods remain constrained to two-dimensional specimens! * 
or to isotropically oriented ultrastructures*-’. These constraints 
limit the use of these methods for studying nanostructures with 
more complex orientation patterns, which are abundant in nature 
and materials science. Here, we introduce an imaging method that 
combines small-angle scattering with tensor tomography to probe 
nanoscale structures in three-dimensional macroscopic samples in 
a non-destructive way. We demonstrate the method by measuring 
the main orientation and the degree of orientation of nanoscale 
mineralized collagen fibrils in a human trabecula bone sample 
with a spatial resolution of 25 micrometres. Symmetries within 
the sample, such as the cylindrical symmetry commonly observed 
for mineralized collagen fibrils in bone®!°, allow for tractable 
sampling requirements and numerical efficiency. Small-angle 
scattering tensor tomography is applicable to both biological and 
materials science specimens, and may be useful for understanding 
and characterizing smart or bio-inspired materials. Moreover, 
because the method is non-destructive, it is appropriate for in situ 
measurements and allows, for example, the role of ultrastructure 
in the mechanical response of a biological tissue or manufactured 
material to be studied. 

The arrangement of the nanoscale and microscale building blocks 
of materials over extended regions has implications for the functional 
properties of many such materials. This is particularly the case for 
many biological structures, where hierarchically ordered organization 
ensures the required properties of the structure; for example, the high 
mechanical stability of bone is necessary for structural support of the 
body, force transmission for locomotion, and mechanical protection 
of vital organs'!"*. High-resolution imaging techniques exist that can 
directly image nanoscale structures'*"*. Aside from the individual 
importance of such information, an overview of how the nanoscale 
structure changes over macroscopic length scales is pivotal to under- 
standing the function of hierarchically organized tissues and mate- 
rials. In the case of human bone, the structural elements of interest 
are the mineralized collagen fibrils, and their orientation is one of the 
factors that determine bone strength at different structural levels!>~'’. 


Experimental techniques exist to probe such structural orientations, 
by using light, X-rays, and electrons, which use direction-dependent 
polarization®°, small- or wide-angle scattering!!§ or diffraction?!2, 
rather than directly imaging the nanostructures. These techniques 
cover a large field of view of thin sections or of the sample surface. 
However, studying 3D samples without the need for sectioning can be 
beneficial; cutting can influence the structures at the surface and, for 
some samples, cutting is not feasible or applicable at all. 

X-ray techniques are adept at probing the inside of macroscopic 
objects, owing to their high penetration depth, even in dense materials 
such as mineralized bone tissue. X-ray photons scatter where changes 
in electron density occur. The scattering angle, or momentum transfer 
q, is inversely related to the characteristic sizes of the structure under 
investigation. Thus, at high scattering angles, atomic length scales 
can be resolved (using wide-angle X-ray scattering, WAXS, or X-ray 
diffraction, XRD), whereas at small angles, scattering from structures at 
the nanometre scale can be detected (using small-angle X-ray scattering, 
SAXS). Ultrastructure with a high degree of orientation results in an 
anisotropic scattering pattern, which can be captured with a two- 
dimensional (2D) detector, and the orientation analysed in each q range 
separately®. Spatially resolved SAXS, where a specimen is scanned 
through an X-ray pencil beam, is a valuable tool for mapping ultrastruc- 
tural features in macroscopic samples!~>*. However, the samples must 
be thin, or at least of uniform structure in the direction of the incoming 
beam, to avoid losing spatial information as a result of averaging along 
the beam path. There have been efforts*-””** to combine SAXS or 
WAXS with computed tomography by rotating the sample with respect 
to the X-ray beam and scanning at different angular steps. In these cases, 
the reconstruction was carried out by integrating the scattered intensity 
over aq range of interest and applying standard computed-tomography 
reconstruction algorithms, which use filtered backprojection or alge- 
braic reconstruction techniques. This reconstruction procedure is 
applicable only if the scattering is invariant with respect to sample 
rotation, as is the case for isotropic scatterers or tissues and mate- 
rials with structural symmetry around the rotation axis. For more 
general, anisotropically oriented ultrastructures, a reconstruction of 
the full three-dimensional (3D) reciprocal-space map is needed to 
provide a representation of the distribution of the nanoscale structure. 
Thus, in each subvolume of the structure (a voxel), a tensor needs to 
be reconstructed, in contrast to a scalar value as is reconstructed with 
X-ray absorption tomography. Tensor tomography is applied in mag- 
netic resonance imaging, where the orientation-dependent magnetic 
relaxation is used to measure the diffusion of water in tissue (diffu- 
sion tensor imaging)”°. Directional dark-field imaging has been suc- 
cessfully combined with tensor tomography to investigate structures 
and orientations at the micrometre scale*®. We developed a new algo- 
rithm, which combines small-angle scattering with tensor tomography, 
allowing nanoscale structures to be probed and enabling a reconstruc- 
tion of the 3D reciprocal-space map for each q range individually. 
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Figure 1 | Schematic of the experimental set-up for SAS tensor 
tomography. The sample is scanned through an X-ray pencil beam 

(25 x 25m”) at different sample orientations while a 2D detector is 
recording the SAS pattern for each scanning point. In addition to rotation 
around the tomographic axis (y), different tilt angles around the x axis 

are measured. For the tomographic reconstruction, the sample volume 

is subdivided into small subvolumes (voxels), the size of which are 
defined by the width of the X-ray pencil beam. The detector frame probes 
ultrastructural length scales from a few nanometres up to a few hundred 
nanometres. This allows us, for instance, to probe mineralized collagen 
fibrils, as illustrated. The distinct Bragg reflection highlighted by an arrow 
originates from the regular arrangement of the approximately 65-nm 

gaps in the collagen fibrils, which contain mineral crystals. The scattering 
of the fan-shaped profile perpendicular to it originates from the shape, 
size and lateral arrangement of the mineral crystals, and is affected in the 
investigated range (indicated by white circles) by the lateral packing of the 
collagen fibrils’. 


Figure 1 shows a schematic of the experimental set-up and the principle 
of the method, which we refer to as small-angle scattering (SAS) tensor 
tomography. It allows tissues and materials with anisotropically ori- 
ented nanostructure to be studied at 3D spatial resolutions limited only 
by the size of the X-ray beam, which is typically at the micrometre scale. 

To demonstrate the method, we present a study of a trabecular bone 
specimen with a size of about 1 x 1 x 2.5mm? extracted from a human 
vertebra. The scattering angles covered by the detector correspond to 
structures in the range of 5.4 nm to 300 nm. This range includes the 
multi-scale structural characteristics of bone tissue, including the 
arrangement of the hydroxyapatite mineral crystal platelets with sizes 
of approximately 3 x 25 x 50nm%, their intra-fibrillar spacing (approx- 
imately 65 nm), and the diameters of collagen fibrils (50-200 nm) 
(refs 9, 12, 27). Most of the SAXS signal originates from the contrast in 
electron density between the mineral crystal platelets and the surround- 
ing lower-electron-density materials such as collagen fibrils and water’. 
It has been shown that the orientation of crystals, collagen fibrils, fibril 
bundles and fibres are closely related”*!2!7, On the basis of this result, 
any of these structures can be probed to estimate the orientation of 
mineralized collagen fibrils in bone. 

The sample is scanned through a pencil X-ray beam, which had 
dimensions of 25 x 25 um? for this experiment. In each scanning point, 
a 2D scattering pattern is recorded, which is a sum of the scattering of 
all voxels in the beam path. Similar to first-generation tomography, 
this scan is repeated for each orientation around the tomographic (y) 
axis. If the step size of the scanning and the angular spacing are cho- 
sen appropriately, the spatial resolution of the tomogram is limited 
only by the size of the X-ray beam. Accordingly, the scanning step size 
was 25 um and the angular spacing was 4.5° between 0° and 180°. In 
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contrast to the X-ray absorption probed in conventional tomography, 
the 2D scattering pattern that emerges from each probed voxel depends 
on the relative orientation of the underlying nanostructure with respect 
to the X-ray beam, and thus changes with sample orientation. Hence, 
for the reconstruction of the 3D reciprocal-space map in each voxel, 
additional data need to be collected, which required multiple tilt angles 
of the tomographic axis. Here, data were collected at six different tilt 
angles of the tomographic axis, resulting in a total of 240 measured pro- 
jections, each of which was measured by scanning 122 x 55 positions, 
resulting in 1.6 million scattering patterns with a total exposure time 
of 22.5h. For each scattering pattern, we reduced the data by azimuthal 
integration over 16 sectors and radial integration over the q range corre- 
sponding to distances from 85 nm to 165 nm, as indicated by the white 
circles in the detector frame in Fig. 1. 

For reconstruction of the orientation, we model the 3D 
reciprocal-space map with a series of spherical harmonics Y;” of degree 
land order m, which provide a complete basis for the intensity distri- 
bution on the sphere corresponding to a fixed q range. Spherical 
harmonics have been used in texture analysis to reconstruct orientation 
distribution functions on the basis of pole-figure analysis from XRD 
measurements”*”*. One advantage of using spherical harmonics is that 
known symmetries can be enforced by choosing the appropriate degree 
land order m. For the trabecular bone studied, we assume that the 
mineral platelet surface normal is randomly oriented in the plane per- 
pendicular to the fibrils, so we expect a point symmetry around q=0 
and a rotational symmetry around one axis in the 3D reciprocal-space 
map in the q range investigated. Therefore, we use here only the spher- 
ical harmonic functions with even degree / and zero order m. An opti- 
mization algorithm is used to minimize the error between the modelled 
intensity of voxels in the beam path projected to the detector plane and 
the measured intensity for each scanning point under all rotation 
angles. Figure 2 visualizes how all measured scattering patterns are used 
to reconstruct the reciprocal-space map from which the orientation of 
the nanostructure is determined. 

The main orientation in the 3D reciprocal-space map is defined 
by the relative orientation of the zenith of the spherical harmonic 
functions with respect to the coordinate system of the sample. Thus 
the main orientation in each voxel is given by polar and azimuthal 
spherical coordinate angles, which are optimization parameters of 
the reconstruction. The degree of orientation is calculated as the ratio 
between the anisotropic scattering (which is described by spherical har- 
monic functions with degree />0) and the total scattering. Assuming 
co-alignment between collagen fibrils and mineral crystals”*!*!”, the 
main orientation of the scattering in the investigated q range reveals the 


0.3 
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Figure 2 | Data processing. More than a million SAXS patterns (left) are 
used to reconstruct the 3D reciprocal-space map for each voxel, which is 
modelled using spherical harmonics (middle panel shows 4 x 3 x 3 voxels) 
and provides a representation of the nanoscale structure distribution. 
From the reconstruction, the main ultrastructure orientation as well as the 
degree of orientation is determined. The former is visually represented by 
the orientation of the cylinder (right); the latter is illustrated both by the 
colour (indicating the ratio of anisotropic scattering to total scattering; 

see colour scale) and the length of the cylinders. 
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Figure 3 | Orientation of collagen fibrils within a human trabecular 
bone sample. a, Computed-tomography reconstruction obtained 

from the transmitted intensity using standard filtered backprojection. 

b, Orientation of bone ultrastructure as determined using SAS tensor 
tomography (see also Supplementary Video 1). c, One tomographic slice 
of the reconstruction shown in b. The cylinder orientations represent 
the main orientation of collagen fibrils in the corresponding voxel as 


spatial arrangement of the mineralized collagen fibrils. The anisotropic 
scattering of bone is related to the fraction of aligned mineralized fibrils 
on average over a certain voxel. Hence, the higher the degree of ori- 
entation, the higher the ratio of aligned fibrils compared to randomly 
oriented fibrils!® (see Fig. 3). The degree of orientation is calculated 
directly from the coefficients of the spherical harmonic functions, 
which are additional optimization parameters of the reconstruction; 
see Methods for details. In the visualization of the reconstructed ultra- 
structure the degree of orientation is represented both by the colour 
and the length of the cylinders (Fig. 2, right). 

Simultaneous to the SAXS patterns, the X-ray absorption was 
measured with a photo diode mounted on the beam stop blocking the 
direct beam in front of the detector. The corresponding computed- 
tomographic reconstruction is shown in Fig. 3a. The result of SAS 
tensor tomography on the human trabecular bone sample is shown in 
Fig. 3b as a 3D visualization (see also Supplementary Video 1). One 
tomographic slice of the reconstruction is shown in Fig. 3c. Domains 
with a low degree of orientation (blue) and a high degree of orientation 
(red) are identified. 

The presence of domains with a size of several tens of micrometres 
and the observation that high degrees of orientation are found in places 
with higher curvature in the trabecula is in agreement with previous 
findings for trabecular bone’. As expected, in the curved regions, the 
collagen fibrils follow closely the trabecular bone microstructure. 

The technique presented here can be tuned to specific characteris- 
tics of the samples under investigation. For example, the range of sizes 
of the ultrastructure probed can be varied by changing the analysed 
scattering angles, that is, the q range. One advantage of the method is 
that the angular scattering range of interest is digitally extracted from 
the scattering frames such that different length scales can be probed 
simultaneously and reconstructed from the same data set. Independent 
of the q range, the real-space resolution of the spatial mapping can 
be adapted by the size of the X-ray beam and the scanning step size. 
The analysis algorithm would also allow the reconstruction of the full 
q range, similar to what has been shown for SAXS tomography for 
isotropic scatterers*®. This enables the size and form of the scatter- 
ing objects to be determined, as is done using standard SAS analysis. 


schematically depicted in d and e. The degree of orientation is represented 
by both the colour (indicating the ratio of anisotropic scattering to 

total scattering) and the length of the cylinders, where a low degree of 
orientation (blue) means a low degree of alignment of the collagen fibrils 
(d), and a high degree of orientation (red) means the collagen fibrils are 
well aligned with respect to each other (e). Scale bars in a and b 
correspond to 0.5mm. 


The method is not limited to the symmetries apparent in the example 
of trabecular bone. Spherical harmonics provide a complete basis for 
modelling the 3D reciprocal space, and azimuthal structure can be cap- 
tured by adding azimuthal orders m into the optimization. Yet, prior 
knowledge of symmetry of the scattering object can be used to reduce 
the complexity of the optimization problem and, potentially, the num- 
ber of projections needed to reconstruct the structure. Limits on the 
complexity of a 3D reciprocal-space map that can be recovered with the 
method presented here and rational sampling requirements remain to 
be explored. Nevertheless, the flexibility of resolution both in real and 
reciprocal space and the non-destructiveness of SAS tensor tomogra- 
phy make this imaging technique widely applicable in biological and 
materials science, providing information complementary to nanoscale 
imaging, which will support modelling and improve understanding of 
biological tissues and complex materials. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample preparation. A trabecula was extracted from a T12 human vertebra from 
a 73-year-old man and cleaned from soft tissue. The trabecula (approximately 
1 x 1 x 2.5mm} in size) was isolated using a scalpel and embedded into poly- 
methyl methacrylate (PMMA). The human vertebra had been obtained from the 
Department of Anatomy, Histology, and Embryology at the Innsbruck Medical 
University, Innsbruck, Austria, with the written consent of the donor according 
to Austrian law. All subsequent procedures were in accordance to Swiss law, the 
Guideline on Bio-Banking of the Swiss Academy of the Medical Sciences (2006) 
and the Swiss ordinance 814.912 (2012) on the contained use of organisms. 
Experiment. Experiments were carried out at the cSAXS beamline (X12SA) of 
the Swiss Light Source, Paul Scherrer Institut, Switzerland. The 12.4-keV X-ray 
beam was defined by a fixed-exit double-crystal Si(111) monochromator to an 
energy bandwidth of about 2 x 10~‘ and focused to a beam size of approximately 
25 x 25m”. Horizontal (sagittal) focusing was achieved by bending the second 
monochromator crystal and vertical (meridional) focusing by bending a Rh coated 
mirror. The sample was mounted on a goniometer on two perpendicular rotation 
stages on top of a two-axis scanning stage. A 7-m-long flight tube under vacuum 
was inserted between the sample and the detector to minimize air scattering and 
X-ray absorption. Scattering patterns were collected with a PILATUS 2M detector*?! 
placed 7.15-m downstream of the sample. Simultaneously, the transmitted beam 
was measured with a diode mounted on a beam stop blocking the direct beam 
before the exit window of the flight tube. 40 projections at tilt angles between 0° 
and 180° around the tomographic rotation axis were collected for 6 different tilt 
angles of the rotation axis between — 30° and 45°, with 6,710 scan points each. 
This resulted in a total of 1,610,400 scattering patterns and a total exposure time 
of 22.5h, with an individual frame exposure time of 50 ms. Each voxel was exposed 
for 12s in total. The dose d was estimated using d =uNoe/p (ref. 32), where 4/p 
is the mass-attenuation coefficient (37.5 cm” g! for bone at 12.4 keV; ref. 33), 
No is the number of incident photons per unit area, and ¢ is the photon energy. 
Thus the dose imparted on the bone specimen is about 2.9 x 10’ Gy. To assure that 
no structural changes were induced by radiation damage, the projection at zero 
rotation around both axes was repeated at the beginning, in the middle, and at 
the end of the experiment. In future experiments with increased resolution or on 
specimens prone to radiation damage, the dose could be reduced by using fewer 
tilt angles, lower photon counting statistics and a higher X-ray energy than in this 
proof-of-concept experiment. 

The total time for the experiment was 35.5 h, which includes an overhead of 13h 
due to motor movements. This will be substantially reduced by faster sample stages 
with precise positioning, which will also enable higher spatial resolution. Because 
the exposure time was limited by detector data transfer and not by photon flux, we 
expect that the availability of faster detectors soon will allow for greatly reduced 
acquisition times or an increased number of resolved voxels at existing facilities. 
Reconstruction. The SAXS data were reduced by azimuthal integration within 16 
segments. For further analysis we integrated over a q range of 0.0379-0.0758nm_}, 
which corresponds to length scales of 165-85 nm. Projections were aligned with 
subpixel accuracy on the basis of the absorption measurements on the beam stop 
diode using an efficient image registration approach based on cross-correlation*’. 
A gradient-based optimization algorithm was used to minimize the error metric 


e=2)0, (Ji a Ji ) , where J; is the measured intensity and I, is the modelled 


intensity from the ith scattering pattern. This error metric was chosen because 
minimizing it is a first-order approximation to a maximum-likelihood estimation 
for photon-counting Poisson noise**. Owing to the high number of parameters to 
optimize, the gradient of the error metric with respect to such parameters is 
calculated using an analytical expression rather than finite differences. 

The modelled intensity f is obtained by calculating the intensity pattern of 
each voxel for each orientation and summing them over the corresponding beam 
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path or ‘projection. To reduce artefacts due to projections in a rectangular 3D 
grid along arbitrary orientations, the volume was upsampled by a factor of two 
before the projection. For the projection, bilinear interpolation was used 
followed by a triangular 2D filter and by a final downsampling. The 
reciprocal-space intensity in each voxel was calculated using a series of spherical 
harmonic functions Y;” of degree / and order mas R = 0,0, 41 Y1"(8 6) es 
where 0 and ¢ are the polar and azimuthal angles, respectively, and a,” are coef- 
ficients of the spherical harmonics. To model the scattering from the collagen 
fibrils, we used spherical harmonic functions Y;" of degree 1=0, 2, 4, 6 and of 
order m=0. The square of the sum was used to avoid negative intensities. To 
guide the solution towards the a priori known rotational symmetry of the colla- 
gen fibrils, smooth penalty terms were introduced for the coefficients aj". For 
the gradient calculation, a backprojection operator is needed, which was imple- 
mented using bilinear interpolation. To accelerate convergence of the 
reconstruction, the gradient of the error metric with respect to the coefficients 
a;" was spatially convolved with a 3D Hamming window of 3 x 3 x 3 voxels, 
which can be understood as an application of Grenander’s method of sieves*”. 
The parameters of the optimization in each voxel were the polar angle @) and 
azimuthal angle ¢o, representing the orientation of the probed ultrastructure in 
3D as well as the coefficients aj” of the spherical harmonics, from which the 
degree of orientation o is calculated as 


[f(a +a4Y? +agve) sindd¢dé 
Jf (ave +ao¥) +a4Y2 Hagley sindd¢dé 


o= 


which corresponds to the ratio between the anisotropic scattering and the total 
scattering. To accelerate convergence, the optimization was performed following 
a three-step approach. First, the angles that define the spherical harmonics were 
optimized while the coefficients were kept fixed, which helps to initially optimize 
a subset of the parameters. This was followed by optimizing the coefficients of the 
spherical harmonics while the angles obtained in the first step were kept constant. 
Typically 20 iterations were performed for these first two steps. Finally, all param- 
eters were optimized simultaneously, with this last step reaching convergence 
after only two further iterations. Using a development prototype implementation, 
the reconstruction took three days running on a standard computer server with 
12 CPU cores and up to 24 threads process in parallel. The algorithm lends itself 
well to further parallelization, and an optimized code running on a computer 
cluster would speed up the processing substantially. 

Code availability. The computer code that was used for the reconstruction is cur- 
rently under further development, documentation and optimization and, hence, is 
currently not publicly accessible. Details of the earlier version of the code as used 
in the paper are available from the authors on request. 
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When used in combination with raster scanning, small-angle X-ray 
scattering (SAXS) has proven to be a valuable imaging technique 
of the nanoscale’, for example of bone, teeth and brain matter”>. 
Although two-dimensional projection imaging has been used to 
characterize various materials successfully, its three-dimensional 
extension, SAXS computed tomography, poses substantial 
challenges, which have yet to be overcome. Previous work*"! using 
SAXS computed tomography was unable to preserve oriented 
SAXS signals during reconstruction. Here we present a solution to 
this problem and obtain a complete SAXS computed tomography, 
which preserves oriented scattering information. By introducing 
virtual tomography axes, we take advantage of the two-dimensional 
SAXS information recorded on an area detector and use it to 
reconstruct the full three-dimensional scattering distribution in 
reciprocal space for each voxel of the three-dimensional object in 
real space. The presented method could be of interest for a combined 
six-dimensional real and reciprocal space characterization of 
mesoscopic materials with hierarchically structured features with 
length scales ranging from a few nanometres to a few millimetres— 
for example, biomaterials such as bone or teeth, or functional 
materials such as fuel-cell or battery components. 

By raster scanning an object through a focused X-ray beam and 
recording a scattering pattern at each point, which we refer to as a pro- 
jection in the following, scanning SAXS imaging combines the ability 
of SAXS to probe the nanoscale with spatial resolution. Changes in 
nanostructure over an area many orders of magnitude larger can be 
revealed using this technique'. Because a SAXS pattern is recorded 
at every scanned point on an area detector (Fig. 1b), it is possible to 
extract oriented scattering information. This information may be of 
great importance for the understanding of material properties, and has 
proven to be very useful for the characterization of highly anisotropic 
structures, such as collagen? ple. 

Owing to the penetrating nature of X-rays, scanning SAXS may 
be used to image thick samples. One drawback of pure projection 
imaging is that the signal is integrated along the beam direction, and 
consequently all depth information about the scattering structures 
is lost. It is possible to recover the exact location of a signal in three 
dimensions using computed tomography. In the case of SAXS, com- 
puted tomography has previously been demonstrated, for example, for 
polymer fibres, amorphous glass and myelin sheaths®'’. Compared 
with other tomographic techniques such as micro computed tomo- 
graphy or ptychographic tomography, which are either unable to go 
down to nanometre resolution or are limited to very small sample 
volumes'*!9, SAXS tomography is able to probe structural infor- 
mation at the nanoscale for objects of comparatively much larger 
dimensions. 

However, only a small fraction of the full two-dimensional (2D) 
SAXS signal has been able to be used for a correct reconstruction thus 
far, and, in particular, the orientation information contained in 2D 
SAXS patterns has not been able to be preserved during tomographic 


SAXS reconstruction. We demonstrate the full potential of SAXS 
tomography, by deriving a way to combine orientation-sensitive scan- 
ning SAXS imaging with computed tomography. Therefore we can 
obtain the full three-dimensional (3D) scattering distribution in each 
voxel of the 3D space (hence a six-dimensional (6D) combined real and 
reciprocal space map of the specimen). 

For most standard reconstruction techniques, a correct result is only 
possible if the signal is rotationally invariant, that is, the sum of all line 
integrals that form a projection image does not depend on the rotation 
of the sample. The most prominent example where this condition is 
satisfied is in the attenuation contrast of, for example, neutron or X-ray 
tomography as used in many industrial and clinical applications. In the 
case of SAXS tomography, rotational invariance is not generally present 
in the signal, except in special cases such as when imaging isotropic 
scattering powders, colloids or samples with certain symmetries. This 
lack of invariance restricts the number of applications, and excludes a 
wide range of materials with a preferred scattering orientation, includ- 
ing biomaterials composed of oriented collagen fibres and a wide range 
of functional materials with fibrous nanostructures. 

In addition to being observed in isotropic samples, rotational invari- 
ance also exists in oriented scattering samples if the momentum transfer 
vectors q, that is, the scattering orientations, are parallel to the rotation 
axis”. This implies that for an arbitrary sample, a correct reconstruction 
is guaranteed only for q vectors parallel to the tomography axis; all 
other scattering orientations cannot be reconstructed. Therefore, for a 
complete reconstruction of all possible q vectors in three dimensions, 
additional rotations of the sample are necessary. 

One solution to the problem of requiring additional rotations of the 
sample is to measure and reconstruct the sample from a large num- 
ber of different rotation axes, each parallel to q vectors in one specific 
direction. This would be highly inefficient and time consuming and 
thus infeasible in practice. Instead, we add the additional necessary 
rotations by introducing a tilt to the conventional rotation axis around 
the horizontal axis perpendicular to the beam direction (Fig. 1a). By 
doing so, projections are recorded from all possible sample orientations 
by combining the tomographic rotation y with a tilt by an angle 0. For 
each of the SAXS patterns of one projection, the scattering vectors qq 
recorded on the detector represent the integration of 2D slices of the 
3D q spaces of each voxel along the path of the X-ray beam. For the 
purpose of tomographic reconstruction, it is thus vital to know which 
q vectors are probed by each projection. This information is obtained 
by using a projection-dependent rotation matrix A; that describes the 
relationship between the laboratory and sample coordinate systems: 


q =A; "44 (1) 


If we define the direction of the X-ray beam to be along the z axis of 
the laboratory coordinate system, then each projection, indexed i, is 
specified in the sample coordinate system by its plane normal vector 
n,—A, '(0, 0, 1)". 
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a Detector 


Figure 1 | Experimental set-up and raster-scanning technique. 

a, Schematic representation of the experimental set-up. The sample is 
raster scanned through a focused X-ray beam, indicated by the vertical and 
horizontal arrows, and a diffraction pattern is recorded at each point. In 
addition to a rotation y around the tomography axis, the sample is rotated 
by 6 around an additional axis. b, Each pixel of a SAXS projection consists 
of an entire SAXS pattern rather than just a single value. c, Schematic 


Only q vectors parallel to the tomography axis can be reconstructed, 
and every SAXS projection probes a 2D slice of the 3D q space. 
Consequentially, all projections with orientation n; perpendicular 
to an axis ¢, given in sample coordinates, must necessarily contain q 
vectors parallel to tand so fulfil the rotational-invariance requirement 
for a rotation around t (see Methods for a more general discussion on 
rotational invariance). 

We now introduce virtual tomography axes. Instead of physically 
rotating the sample around an axis t, we select a subset of the full set of 
available projections that contains all those with q vectors parallel to the 
desired axis t. This technique is illustrated in Fig. 1c. Several projections 
are sketched in the sample frame of reference. For three select axes t 
within the sample, projections with q vectors parallel to t are selected, 
shown in Fig. 1c by arrows of matching colours. The orientation on 
the detector that contains the rotationally invariant SAXS data for each 
axis t and projection is indicated by the respective arrows orientations. 

Owing to a finite number of possible recorded projections, it is 
unlikely that a perfect match between t and the available projections 
will be found. We use the scalar product |n;- t| to quantify the mismatch 
between the projection n; and the virtual tomography axis ¢. For most 
materials of interest, the SAXS signal generally varies slowly and does 
not exhibit sharp azimuthal peaks (contrary to crystal diffraction, for 
example); consequently, projections for which 0 < |n;-t| < 1 may be 
included as well as those for which |n;-t|=0, to ensure a sufficient 
number of projections for the tomographic reconstruction. Our 
method is not suitable for materials such as nearly perfect single- 
crystalline objects, for which the SAXS signal does not vary slowly. 

Given such a subset of possible projections, for each individual pro- 
jection, we need to calculate the orientation of the recorded scattering 
vectors qq corresponding to q||t. The correct part of the detector is 
calculated by projecting t onto the plane specified by n;, which allows 
us to account for the small discrepancy introduced by considering pro- 
jections for which 0<|n;-t| <1: q,=t— (t-n,)n,. Using virtual 
tomography axes, many different axes f, and therefore q vectors, can be 
reconstructed by selecting the appropriate subsets and corresponding 
qa vectors. A major advantage of this method is that all the information 
from every SAXS pattern can be used for a complete reconstruction of 
the 3D q space in each voxel, rather than just a few qq vectors, as is the 
case when using the standard tomography axis. Removing the require- 
ment of having to directly measure a large number of tomography axes 
and instead reusing the same projections for many different virtual 
tomography axes allows us to efficiently perform a real and reciprocal 
space resolved 6D SAXS computed tomography characterization. 

To demonstrate the usefulness of the method, particularly for the 
characterization of biomaterials, we performed an experiment to 
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representation of the virtual-tomography-axis technique. Projections 
recorded at different views of the sample are used for various virtual 
tomography axes, represented by the red, green and blue arrows within 
the sample. Each of these axes is reconstructed independently by using 
appropriate projections and detector segments, as indicated by arrows of 
matching colour. 


resolve the nanostructures found in a human tooth. Teeth are made 
of a complex, highly hierarchically structured material that is able to 
withstand years of cyclic load in the harsh environment of the mouth. 
Knowledge of the exact structural details of this material at length 
scales spanning the nanoscale to the macroscale is still missing, and 
would help to understand the basic structure-function relationships 
that have been optimized during millions of years of evolution. This 
understanding is important for the design of tailored synthetic materi- 
als that mimic the durability and stiffness of teeth. An overview of four 
different tissues present in the investigated sample is shown in Fig. 2a. 
Mean mineral density information, observed along the long axis of the 
sample and obtained using high-resolution, state-of-the-art absorption 
tomography, reveals the layers of cementum and of mantle, primary 
and secondary dentine in the sample. All these tissues are composed 
of mineralized collagen fibres that give rise to a strong characteristic 
SAXS signal, owing to the periodic arrangement of mineral platelets 
along the fibre axis with approximately 67.6-nm spacing’. However, 
the details of the nano-architecture that is essential for the mechanical 
performance remains unresolved. 

Using the SAXS computed tomography technique described above, 
we reconstructed the full 3D scattering distribution in every voxel for 
the whole q range, rather than reconstructing just one value as in con- 
ventional tomography. From this data, we virtually extract information 
about what the reciprocal scattering space from each voxel looks like 
from any given orientation. Planar cuts along the coordinate-system 
axes of this voxel-wise 3D information are shown in Fig. 2b for three 
different regions within the sample. The top, blue-ringed slices of 
Fig. 2b corresponding to the pulp region of the tooth sample, which 
consists of secondary dentine, have a pronounced vertical fibre orien- 
tation, seen as clear collagen peaks in the q, direction. Different ori- 
entations and scattering signal strengths are seen in the green- and 
red-ringed slices, which correspond to regions of the tooth that are 
composed of primary and mantle dentine. 

Given the abundance of reconstructed data, a major difficulty is to 
extract the information of interest and compress the results into an 
easily understandable form. One possibility for determining collagen 
orientation is as follows. As well as diffuse background scattering that 
stems from the fibre bundles, collagen fibres display distinct peaks 
around |q| +0.9A~!. Previous studies”° have subtracted the back- 
ground scattering and isolated the collagen peak by fitting a power-law 
function with an exponent of about 2.6 (ref.). To extract the orienta- 
tion of the collagen peaks, we took the ratio of q-vector intensities in 
the range with (0.88-0.94 A-}) and without (0.82-0.86 A~!) collagen 
peaks. Assuming a constant exponent for the background scattering, 
the ratio of q-vector intensities deviates from a constant value only 
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Figure 2 | Virtual SAXS patterns extracted from the reconstructed 

3D volume. a, Average density within the sample obtained using state-of- 
the-art absorption micro computed tomography. Distinct dental tissues 
found in the sample are identified around the root canal (r): secondary 
dentine (s), primary dentine (p), mantle dentine (m) and cementum (c). 
b, 2D slices along the coordinate axes of the reconstructed 3D q= 

(4x Jy» Iz) space. The logarithmic scattering strength for three points in 


in those directions with additional scattering caused by the distinct 
periodicity of the collagen fibres. From the resulting 3D distribution of 
ratios, we extracted the mean fibre orientation by ellipsoid fitting and 
determining the largest principal axis. The resultant orientations for 
example slices inside the sample are shown in Fig. 3. The fibre orienta- 
tions are represented by small bars whose colour represents the average 
scattering intensity in the collagen range 0.88-0.94 A~!. Their length 
is scaled with the respective absorption values, so that equally long 
and vanishingly small bars are obtained inside and outside the sample 
volume, respectively. A strong change in collagen fibre orientation near 
the root-canal region (upper side of Fig. 3) of the sample contrasts the 
gradual increase in scattering strength towards the outer sides of the 
root (lower side of Fig. 3). The collagen fibre orientation is just one 
example of many possible parameters that can be extracted from the 
6D data. Various other parameters have been extracted from SAXS 
data previously, such as the local degree of alignment of the mineral 
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Figure 3 | 3D visualization of collagen fibre orientation within the 
tooth sample. The orientation of the coloured bars indicate the mean 
orientation of collagen fibres obtained from ellipsoid fits to the ratios of 
scattering intensities of q ranges with and without collagen peaks. The 
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the centre of the sample, indicated by the blue, green and red squares, is 
shown (colour coded by the outer ring on the plots) for slices in the q.-q,, 
4z-4x and q,—qy planes. Distinct collagen peaks can be seen at |q|~0.9 A, 
except for slices roughly perpendicular to the fibre orientation. The 
orientation of the fibres and the intensity of the peaks change noticeably in 
the different regions of the sample. An animation showing multiple slices 
is provided in Supplementary Video 1. 


platelets contained within the collagen matrix, or their shape and size 
distribution”’. 

In conclusion, we demonstrated a combined 6D real and reciprocal 
space SAXS computed tomography technique as a method to simul- 
taneously obtain information about the structural features of a sample 
from the nanometre to the millimetre scale. Using virtual rotation axes, 
we take advantage of all data recorded and are able to reconstruct the 
full 3D scattering distribution in each voxel of the 3D real space of 
the sample. This vast amount of additional information in each voxel 
could enable analysis of the 3D nanostructure of materials, spatially 
resolved in 3D on a large scale. As with all imaging techniques, analysis 
of the resultant data is a big part of the work. Given the amount of data 
reconstructed by the presented method, there is a need for data anal- 
ysis tailored to a specific problem. For the purpose of the investigated 
collagen sample, we proposed one possible way to extract fibre orienta- 
tions in three dimensions. Although our approach is a substantial step 


colour represents the average scattered intensity in the collagen range 
0.88-0.94 A~!. The underlying 3D nanomorphology within the entire 
sample is revealed. An animation showing all slices in one direction is 
provided in Supplementary Video 2. 
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forward, measurement and reconstruction times were considerable. 
(A discussion of the prospects of advances in hardware and the subse- 
quent reduction in time needed for an experiment is given in Methods.) 
Nonetheless, we believe that a combined 6D real and reciprocal space 
characterization is of great interest for nanostructure characterization 
of mesoscopic materials and composites with hierarchically structured 
features ranging from the few-nanometre to the few-millimetre length 
scale. Possible applications of such a characterization include the study 
of natural bones and teeth, or man-made functional materials. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sample. A human upper lateral incisor was obtained from a pool of teeth col- 
lected from patients undergoing routine dental treatment unrelated to this study. 
The tooth was obtained with written informed consent, following the directives 
of the Ethical Review Committee of the Charité - Universitatsmedizin Berlin 
(EA4/002/09). The tooth had no caries and was extracted for periodontal reasons 
from an anonymous donor. For the SAXS experiment, a cylindrical segment of 
the tooth about 3 mm in diameter and 4mm in length was extracted using water- 
cooled dental drills. The harvested piece of the root was extracted from a region 
beneath the tooth crown, facing the lip (buccal side of the tooth). 

Experiment. The measurements were conducted at the X12SA (cSAXS) beamline 
of the Swiss Light Source (Paul-Scherrer Institute) with monochromatic 18.6-keV 
X-rays. A pencil beam of 50 jum x 50 \1m at the sample position was used to raster 
scan the specimen with a matching step size of 50 |1m and exposure time of 50 ms 
per point. In total, scanning SAXS projections were recorded from 288 unique 
sample orientations. The 288 projections were recorded for 10 different tilt angles 
0=0°, —4°, —12°, —20°, —28°, —36°, —44°, —52°, —60°, —68° of the tomography 
axis. A varying number (55, 29, 29, 29, 29, 29, 29, 25, 19, 15) of tomography angles 
y, evenly distributed over 360°, was used for each of the tilted tomography scans. 
The 6 = —12°, —28°, —44° and —60° tomographic measurements included a 
offset of 6.2° so a more even distribution of the projections could be achieved. Each 
projection consisted of 59 x 81 SAXS patterns, in the horizontal and vertical direc- 
tions, respectively, whereby the horizontal axis was scanned in continuous mode. 
In total, 1,376,352 SAXS patterns were recorded in slightly less than 40h. The 
scattering signal was collected using a photon-counting PILATUS 2M detector”? 
placed 7,363 mm downstream of the sample. Simultaneously, X-ray attenuation 
data was acquired using a diode directly mounted on the beamstop, which was 
used for a necessary normalization of the scattering signal!” and registration of the 
projections. The normalization was done by dividing each recorded SAXS pattern 
at point p (including all variables describing sample position and rotation) by the 
relative absorption of the sample I“°**(p) /1¢°¢¢; 
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Reconstruction. Azimuthal integration was performed using pyFAI*’ for all 
recorded SAXS patterns, regrouping the data into 360 azimuthal segments with an 
extent of 1° each. Following the integration, the data were reduced to an azimuthal 
range of 180° by averaging, making use of the fact that the data are symmetric with 
respect to the centre of the diffraction pattern. 

The selection of projections for each virtual tomography axis included an 
allowed deviation of the scalar product |n;- t|< 0.05 throughout the reconstruction. 
For our 288 recorded projections, this condition ensures that a sufficient number 
of points are picked, without increasing the introduced error more than necessary. 

A virtual tomography axis is expressed in sample coordinates by 
t = A(O,~) ‘(0,0, 1)". The reconstructed virtual tomography axes were chosen 
for 30 different values of 6 from 0° to 90° and a varying number N,= 
60sin(@7/180) + 1 of values of y from 0° to 360°, depending on the current value 
of #. This choice of N,, ensured that the virtual tomography axes were chosen to 
cover a solid angle of 21, allowing for a complete reconstruction of all possible 
orientations. 

In total, 1,168 evenly distributed virtual tomography axes in 13 different q- 
vector intervals ranging from |q|=0.45 A“! to |q|=1.16 A”! were reconstructed; 
15,184 individual reconstructions were performed. Each of these consists of a 
unique set of projections and azimuthal segments. Because only a very small azi- 
muthal part of the detector is used from each SAXS pattern, the radial width of 
the intervals was chosen to integrate over several pixels to increase photon statis- 
tics. The closest azimuthal segment to the projected virtual tomography axis was 
taken, that is, an azimuthal error of less than 0.5°. Each of the reconstructions 
was performed using a SART (simultaneous algebraic reconstruction technique) 
algorithm with total variation regularization, because this reconstruction technique 
has been shown” to perform well with strong undersampling and missing angular 
wedges, as in the present case. The computing time per reconstruction was about 
40s on a computer built for GPU computing (2 x Intel Xeon E5-2643, 4 x Nvidia 
Tesla Kepler K10, 256 GB RAM). Therefore, the reconstruction time for all recon- 
structions amounts to slightly over one week. In principle, all reconstructions 
can be run in parallel because they are completely independent from each other. 
An optimized, parallel reconstruction would thus allow for a substantial reduction 
in computing time—potentially to a few hours. 

To verify a correct reconstruction result, intensity maps of the reconstructed and 
reprojected scattering vectors alongside their measured counterparts are shown in 
Extended Data Fig. 1 for the |q|=0.88-0.94 A“! interval at two select azimuthal 
scattering orientations of one projection. 
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The 3D visualization of the reconstruction results was done using ParaView 
(http://www.paraview.org). 
Absorption computed tomography. The high-resolution micro computed 
tomography of the sample was acquired with a VersaXRM-500 X-ray microscope, 
Xradia (Pleasanton), operated at 50 kVp, 4 W and 30s exposure time. A voxel size 
of 3.087 |1m was achieved. 
Coordinate system. To describe the necessary rotations it is important to define 
coordinate systems, which was done according Extended Data Fig. 2. A fixed lab- 
oratory coordinate system is given by Xjabs Viab» Zab» with Zj,p in the beam direction, 
Yiab pointing upwards and x;,) completing the right-handed coordinate system. 
During the acquisition of each projection, the sample is raster scanned along 
the xiap and yjap axes for a total of n x m SAXS patterns per projection. For the 
reconstruction, a sample coordinate system is defined by its axes x, y, z, which are 
equivalent to the laboratory coordinate axes prior to any rotation. The rotations 
required for 3D SAXS computed tomography are described using Euler angles ~), 
6 and y, which follow a YZY convention. For our purpose, it is convenient to set 
w= 90° so that y and 6 represent the tomographic rotation and its tilt around x,p. 
For all sample rotations, the corresponding rotation matrix A; is calculated as a 
combination of three successive rotations: 


Aj = AyAgA, 
cosy) 0 siny|{cos@ —sin? 0}{ cosp 0 siny 
=| 0 1 0 |/sin@ cos? Oj} 0 1 0 
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The scattering vectors q, = (d,, d,, 0) recorded on the detector are described using 
detector coordinates d, and d,. The relationship between qq and q for each projec- 
tion is given according to equation (1) asq= A; 'q i 

Rotational invariance. Following ref. 25, a rotational invariance check is per- 
formed for a standard, vertical tomography axis as follows. For any horizontal line 


Yiab Of a projection P, a value Ip, (des dy) = 6 hy, (drs dy, Xjqp) is assigned 


to each pixel of the detector as the sum of all pixel values for this specific line scan. 
This function necessarily must be constant under rotation if rotational invariance 
is given—the information changes its horizontal position, but is not lost. Pixel-wise 
rotational invariance is then calculated as the ratio of the standard deviation to the 
mean of Ip ight d,) over all projections P: 


(py, (des d,)) 
Ipy, (de dy) 


A result for the vertical tomography axis of our measurement is shown in 
Extended Data Fig. 3. The pixel-wise rotational invariance is shown for two dif- 
ferent slices of the sample, marked by the white lines. Because a rotation is only 
performed around the vertical axis, these slices are independent from each other. 
Whereas the upper slice (Extended Data Fig. 3b) displays certain rotational invar- 
iance for pixels not on the vertical axis, owing to fibre symmetry, the lower slice 
(Extended Data Fig. 3c) clearly shows that rotational invariance is given only in 
the vertical orientation. 

We want to extend the rotational-invariance check to more than one rotation 
axis, but a line-wise treatment has proven to be insufficient. Because of additional 
rotations, different slices of the sample are no longer independent from each other, 
and the sample needs to be treated as a whole. Rather than summing over one 
scanning line, we have to sum over all scattering patterns obtained for one projec- 

a nm 
tion: Ip(d,, dy) = mii 
virtual rotation axis t, projections from a tilted rotation axis are included in the 
reconstruction. From this it follows that the rotationally invariant data are no 
longer restricted to the vertical direction, but are found in different azimuthal 
segments on the detector for each projection P. Thus, the scattering patterns need 
to be rotated accordingly for a proper comparison. This rotation is most easily done 
with the already azimuthally regrouped data Ip(, r), with x the azimuthal and 
r the radial coordinate. A rotation of the data then simplifies to a projection- 
dependent offset of y, and rotational invariance can be checked for any arbitrary 
set of projections and virtual tomography axis t. Extended Data Figure 4 
shows rotational-invariance results for two different virtual tomography 
axes—t = (0, 1 /./2, 1/./2) (Extended Data Fig. 4a) and t= (1/./2,1/./2, 0) 
(Extended Data Fig. 4b)—for already azimuthally regrouped data. For each pattern, 
all data has been shifted so that y =0° or y = 180° corresponds to the detector 


Ip(d,» dy, Xjab> Y;,,)- Additionally, for an arbitrary 
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orientation parallel to the chosen virtual rotation axis. Rotational invariance is 
achieved for any given virtual tomography axis by using data only from projections 
and orientations with scattering parallel to this axis. 

Discussion on acquisition times and resolution. The number of SAXS patterns 
that can be collected in a given time frame is mostly determined by two key para- 
meters of the X-ray beam used in the experiment—the X-ray energy and flux of the 
beam. To reduce the exposure time needed per single SAXS pattern, it is necessary 
to have as many photons contribute to the SAXS signal as possible. We measured 
the tooth sample presented in this work at 18.6 keV, the highest available photon 
energy at the cSAXS beamline. At this energy, a large proportion of the X-rays 
were absorbed in our sample and therefore did not contribute to the SAXS signal. 
Even a small increase in photon energy would have a very noticeable effect on the 
transmission, and ultimately the exposure time required for each point. The second 
major factor for reduction of the scanning time needed is the flux available for the 
experiment. Even though synchrotron facilities today are already very powerful, 
their technology is continuously being improved. An example is the upgrade being 
installed at the European Synchrotron Radiation Facility (ESRF). Planned to be 
finished by 2018, this upgrade is expected to increase the available photon flux by 
up to two orders of magnitude. These ongoing developments will enable very rapid 
data acquisition to be performed in the future, facilitated by recently developed 
fast-framing pixel detectors with frame rates of several kilohertz”®. 

A point of concern arising from much stronger sources is the issue of radiation 
damage. During our experiment we performed several control-scans throughout 
the measurement and confirmed that there was no change in the signal even after 
several hours of beam exposure. It is hard to predict at what point radiation damage 
inflicted by the beam starts to become an issue. There are, however, possibilities to 
limit these effects, such as active cooling with cryo-jets. 

The achievable real-space resolution in SAXS computed tomography is dictated 
mainly by the investigated sample and number of SAXS patterns (points) available. 
The total number of projections strongly depends on the number of individual 
scanning points per projection. From a tomographic point of view, there is little 
benefit from measuring only a few projections with a large number of points each 
or, similarly, a large number of projections with only a few points each. Therefore, 
the number of points per projection cannot be scaled up easily without a mas- 
sive increase in the acquisition time required. As a consequence, the real-space 
resolution is strongly limited by the size of the sample. For reference, the real-space 


resolution of our measurement for a 4-mm object was about 100,1m, because of 
the continuous acquisition used. 

The resolution in reciprocal space has to be discussed for two different cases, 
namely radial and angular resolution in reciprocal space. The radial resolution 
is mostly limited by the angular divergence of the beam and the pixel size of the 
detector. We chose to radially integrate over several detector pixels, which limits 
our radial reciprocal-space resolution to 0.05 A~!. In contrast to this, the angular 
resolution in reciprocal space is largely limited by the number of projections 
recorded. This number determines the deviation allowed when selecting pro- 
jections for each virtual tomography axis. In our case, a deviation of the scalar 
product |n;-t| <0.05 corresponds to a minimal angular resolution of slightly less 
than 6°. 

Considering all these points, the greatest potential to improve the performance 
of our method lies in the use of stronger sources and higher photon energy. Our 
measurement took 40h. An increase in usable flux by a factor of ten would decrease 
the time needed for exactly the same measurement by approximately the same 
factor—to slightly less than 4h. This reduction would enable SAXS computed 
tomography to be used for a case study of several samples. A substantial reduc- 
tion in acquisition time can also be achieved by scanning projections with lower 
real-space and angular reciprocal-space resolution, owing to the relationships 
mentioned previously. 

Code availability. The code used for azimuthal regrouping is openly available 
at https://github.com/kif/pyFAI. Additional code used is available upon request. 
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Extended Data Figure 1 | Comparison of measured and reconstructed 
values for one projection. Left, reprojected q data for two different 
orientations (90° and 152°) is compared to the measured data. Right, 
azimuthal values for both the reconstructed and measured data are given 
for three select points, indicated by the dashed lines in the left panel. For 
the chosen |q| range, distinct collagen peaks are reconstructed correctly 
for points 1, 2 and 3 at around 15°, 0° and 90°, respectively. A good 
agreement between reconstruction and measurement is seen. Animations 
of this and further projections showing all q orientations are provided in 
Supplementary Videos 3-5. 
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Sample coordinate system 


Xiab lab 
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Extended Data Figure 2 | Coordinate system used for the experiment. 
The sample orientation is described using three Euler angles 0, y and w. 
With y= 90°, 6 represents a tilt of the tomography axis around x,,, and 

vy describes a rotation around this tilted axis. The sample is scanned along 
Xab and yiab, With a diffraction pattern collected at each point. The detector 
coordinates are given by d, and d). 
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Extended Data Figure 3 | Rotational invariance for a standard SAXS fibres in the top part of the sample, shown in b, are mainly vertical. Owing 
computed tomography with a vertical tomography axis. a, Absorption to this symmetry, rotational invariance is also present for pixels not on the 
image acquired from the diode data with two vertical slices marked. vertical axis. Without this symmetry, rotational invariance exists only for 
b, c, Rotational invariance as defined in the text for both slices. In both the vertical direction, as seen in c. The white bars in the lower half of the 


cases, rotational invariance is present for all pixels that correspond to images are areas between the individual detector modules. 
scattering orientations parallel to the vertical rotation axis. The collagen 
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Extended Data Figure 4 | Generalized rotational invariance. 

a, b, Rotational invariance shown for different virtual tomography axes: 

t = (0,1/J/2, 1//2) (a) andt = (1/./2, 1/2, 0) (b). Radially 
integrated data are used. Compared to the standard case, shown in 
Extended Data Fig. 3, a line-wise integration is not possible in the general 
case. Instead, the standard deviation and mean are calculated from the 
projection-wise sum of all integrated SAXS patterns. A shift of the 
azimuthal angle so that the scattering orientation parallel to t is at 0° was 
applied. As can be seen, rotational invariance is also achieved in the 
general case for scattering orientations parallel to f. 
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Methane storage in flexible metal-organic 
frameworks with intrinsic thermal management 
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As a cleaner, cheaper, and more globally evenly distributed fuel, 
natural gas has considerable environmental, economic, and 
political advantages over petroleum as a source of energy for the 
transportation sector!”. Despite these benefits, its low volumetric 
energy density at ambient temperature and pressure presents 
substantial challenges, particularly for light-duty vehicles with 
little space available for on-board fuel storage®. Adsorbed natural 
gas systems have the potential to store high densities of methane 
(CH, the principal component of natural gas) within a porous 
material at ambient temperature and moderate pressures’. 
Although activated carbons, zeolites, and metal-organic 
frameworks have been investigated extensively for CH, storage® °, 
there are practical challenges involved in designing systems 
with high capacities and in managing the thermal fluctuations 
associated with adsorbing and desorbing gas from the adsorbent. 
Here, we use a reversible phase transition in a metal-organic 
framework to maximize the deliverable capacity of CH, while 
also providing internal heat management during adsorption and 
desorption. In particular, the flexible compounds Fe(bdp) and 
Co(bdp) (bdp?~ = 1,4-benzenedipyrazolate) are shown to undergo 
a structural phase transition in response to specific CH, pressures, 
resulting in adsorption and desorption isotherms that feature a 
sharp ‘step. Such behaviour enables greater storage capacities than 
have been achieved for classical adsorbents”, while also reducing 
the amount of heat released during adsorption and the impact of 
cooling during desorption. The pressure and energy associated with 
the phase transition can be tuned either chemically or by application 
of mechanical pressure. 

The driving range of an adsorbed natural gas (ANG) vehicle is 
determined primarily by the volumetric usable CH, capacity of the 
adsorbent, which is defined as the difference between the amount of 
CH, adsorbed at the target storage pressure (generally 35-65 bar) and 
the amount that is still adsorbed at the lowest desorption pressure 
(generally 5.8 bar)". With few exceptions'', adsorbents that 
have been investigated in the context of natural gas storage exhibit 
classical Langmuir-type adsorption isotherms, where the amount 
of CH, adsorbed increases continuously, but at a decreasing rate, as 
the pressure is raised (Fig. la). Consequently, it has proved difficult 
to develop adsorbents with the higher usable capacities needed for a 
commercially viable ANG storage system”. In pursuit of a new strategy 
for boosting usable capacity, we endeavoured to design an adsorbent 
with an ‘S-shaped’ or ‘stepped’ CH, adsorption isotherm, where the 
amount of CH, adsorbed would be small at low pressures but rise 
sharply just before the pressure reaches the desired storage pressure 
(Fig. 1b). Stepped isotherms have been observed for many flexible 
metal-organic frameworks that exhibit ‘gate-opening’ behaviour, 


whereby a non-porous structure expands to a porous structure 
after a certain threshold gas pressure is reached, but none of these 
materials have exhibited characteristics beneficial for CH4 storage 
applications!*~!®. If, however, a responsive adsorbent could be designed 
to expand to store a high density of CH, at 35-65 bar, and to collapse 
to push out all adsorbed CH, at a pressure near 5.8 bar, then it should 
be possible to reach higher usable capacities than have been realized 
for classical adsorbents. 

The metal-organic framework Co(bdp) was selected as a potential 
responsive adsorbent for methane storage, owing to its large internal sur- 
face area and its previously demonstrated high degree of flexibility!”. In 
its solvated form, this framework features one-dimensional chains of tet- 
rahedral Co”* cations bridged by u7-pyrazolates to form a structure with 
square channels with edge lengths of 13 A. The N2 adsorption isotherm 
of the evacuated framework at 77 K exhibits five distinct steps, which 
have been attributed to four structural transitions as the framework 
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Figure 1 | High-pressure CH, adsorption isotherms. a, b, The usable 
capacity is compared for an idealized adsorbent exhibiting a classical 
Langmuir-type adsorption isotherm (a) and an ‘S-shaped or ‘stepped’ 
adsorption isotherm (b), with the minimum desorption pressure Paes and 
the maximum adsorption pressure P,qs indicated by the vertical dashed grey 
lines. c, d, Total CH, adsorption isotherms for Co(bdp) (c) and Fe(bdp) (d) 
at 25°C. Here Pages = 5.8 bar and Pags = 35 bar are indicated by dashed grey 
lines. Filled circles represent adsorption; open circles represent desorption. 
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Figure 2 | Powder X-ray diffraction and solid-state structures. 

a, b, Powder X-ray diffraction patterns (26 is the diffraction angle) are 
shown for Co(bdp) (a) and Fe(bdp) (b) at 25°C and variable CH, pressures 
(as indicated), with X-ray wavelengths of 0.75009 A and 0.72768 A, 
respectively. For Co(bdp), the blue and green patterns correspond to the 
collapsed and expanded phases, respectively, with teal indicating patterns in 
which both phases are present during the transition between collapsed and 
expanded. For Fe(bdp), the blue and red patterns correspond to the 
collapsed and 40-bar expanded phases, respectively, with purple indicating 
patterns in which both phases are present during the transition from 


progresses from a collapsed phase with minimal porosity to a maximally 
expanded phase with a Langmuir surface area of 2,911 m* g~! (ref. 18). 

To investigate the ANG storage potential of Co(bdp), a high-pressure 
CH, adsorption isotherm was measured at 25°C (Fig. 1c). There is 
minimal CH, uptake at low pressures and a sharp step in the adsorp- 
tion isotherm at 16 bar. Although there is hysteresis in the desorption 
isotherm, the hysteresis loop is closed by 7 bar, such that there is less 
than 0.2 mmol g~! of CH, adsorbed at pressures below 5.8 bar. The step 
in the CH, isotherm is fully reproducible over at least 100 adsorption- 
desorption cycles (Extended Data Fig. 1), and can be attributed to a 
reversible structural phase transition between a collapsed, non-porous 
framework and an expanded, porous framework at transition pressures 
that are ideal for ANG storage. 

To determine the specific structural changes responsible for the 
stepped CH, adsorption isotherm of Co(bdp), in situ powder X-ray 
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Expanded Fe(bdp) 


collapsed to 40-bar expanded; orange patterns correspond to the 50-bar 
expanded phase. c, d, The bridging ligand precursor Hzbdp (c) along with 
the crystal structures (d) of the collapsed (0 bar, ‘low Puy, ) and CHy- 
expanded (30 bar, ‘high Ry, ’) phases of Co(bdp). e, Each benzene ring in the 
collapsed phase of Co(bdp) has four edge-to-face 1-1 interactions with 
neighbouring benzene rings. f, Crystal structure of the CH,-expanded 

(40 bar) phase of Fe(bdp). In d-f, Purple, orange, grey, blue, and white 
spheres represent Co, Fe, C, N, and H atoms, respectively; some H atoms are 
omitted for clarity. 


diffraction experiments were performed under various pressures of 
CH, at 25°C. Under vacuum, only one crystalline phase is observed 
in the diffraction pattern, consistent with the complete conversion of 
Co(bdp) to a collapsed phase upon desolvation. From 17 bar to 23 bar, 
there are substantial changes to both the positions and intensities of 
the diffraction peaks, as peaks corresponding to the collapsed phase 
decrease in intensity and peaks corresponding to a new expanded phase 
increase in intensity (Fig. 2a). During desorption, this expanded phase is 
fully converted back to the collapsed phase between 10 bar and 5 bar. 
Owing to the anisotropic peak widths and complex peak shapes that 
result from paracrystallinity effects!*, analysis of the powder diffraction 
data is not trivial, but ab initio structure solutions followed by Rietveld 
refinements (Extended Data Fig. 7) were successfully performed using 
the diffraction data at Obar and 30 bar to provide crystal structures of 
the collapsed and expanded phases of Co(bdp) (Fig. 2d). As discussed 
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Figure 3 | Variable-temperature equilibrium isotherms and differential 
enthalpies. a, b, Total CH, adsorption isotherms at various temperatures 
for Co(bdp) (a) and Fe(bdp) (b), where a minimum desorption pressure 

of 5.8 bar and a maximum adsorption pressure of 35 bar are indicated by 
dashed grey lines. Filled circles represent adsorption; open circles represent 
desorption. c, Differential enthalpies of CH4 adsorption (haa) for Co(bdp), 


in the Supplementary Information and shown in Extended Data 
Fig. 8, paracrystallinity arises from highly correlated shifts of the 
positions of Co-pyrazolate chains in the crystallographic a—b plane, 
whereby neighbouring chains exhibit average displacements of approx- 
imately 0.5 A from their average periodic positions. Importantly, this 
minor systematic disordering has no effect on the accuracy of the aver- 
age crystal structures or the calculated crystallographic densities of 
each phase. Additionally, a substantial diffuse-scattering component 
is present in the experimental diffraction patterns, particularly at high 
CH, loadings. Although most of the diffuse scattering can be attrib- 
uted to the thick-walled quartz glass capillaries used as sample holders 
in the diffraction experiments at high CH, pressures (Supplementary 
Fig. 11), there may also be some diffuse scattering that is intrinsic to 
Co(bdp), which could arise from minor local disorder or from scatter- 
ing by adsorbed CH, molecules. 

Even though the density of the collapsed phase (1.50 g cm~%) is nearly 
double that of the expanded phase (0.77 g cm~>), the Co”* ions adopt 
a similar pseudotetrahedral geometry in both structures. During the 
phase transition, the angles between the planes of the pyrazolate rings 
and the Co-N bonds decrease as the framework expands (Extended 
Data Fig. 6). In addition, the central benzene ring of the bdp”~ ligand 
twists out of the plane of the two pyrazolates by 25° in the collapsed 
structure, resulting in edge-to-face 1-1 interactions with four neigh- 
bouring benzene rings that probably provide most of the thermo- 
dynamic driving force for the collapse of Co(bdp) at low pressures 
(Fig. 2e)”°. These close contacts between neighbouring bdp?~ ligands 
lead to no accessible porosity, and thus no CH, adsorption, in the 
collapsed phase. 

The usable CH, capacity of Co(bdp) at 25°C is 155 cm? STP cm? 
(v/v) for adsorption at 35 bar and 197 v/v for adsorption at 65 bar, which 
are the highest values of usable CH, capacity reported so far for any 
adsorbent under these conditions. A recent computational analysis of a 
database containing over 650,000 classical adsorbents predicted a the- 
oretical-maximum 65-bar usable capacity of 196 v/v (ref. 9); however, 


0 2 4 6 8 
Total CH, adsorbed (mmol g-’) 


as determined from variable-temperature adsorption isotherms (purple 
line) and three separate microcalorimetry experiments (open symbols). 

d, Differential enthalpies of CHy adsorption (has) for Fe(bdp), as 
determined from variable-temperature adsorption isotherms. Dashed grey 
lines in c and d indicate the amount of CH, adsorbed at 5.8 bar and 35 bar. 


all adsorbents in this large-scale computational screening were rigid, 
and the potential utility of flexible adsorbents for CH, storage was not 
considered. The Co(bdp) usable capacities reported here are a result 
of the transition from the expanded to the collapsed phase leading to 
near complete CH, desorption by 5.8 bar. For comparison, the high- 
est previously measured 35-bar and 65-bar usable capacities for any 
adsorbent are 143 v/v and 189 v/v, as obtained for the metal-organic 
frameworks HKUST-1 and UTSA-76a, respectively’*"!, Both of these 
Cuz paddlewheel-based frameworks have high densities of CH, adsorp- 
tion sites with a near-optimal (for maximizing the usable CH, capacity 
for ambient temperature adsorption at pressures between 35 bar and 
65 bar) binding enthalpy of —15kJ mol~! to —17kJ mol“, but display 
Langmuir-type adsorption isotherms that leave a substantial amount 
of unusable CH, adsorbed at 5.8 bar. 

One major, and often overlooked, challenge in developing adsor- 
bents for natural gas storage, or indeed for any gas storage appli- 
cation, involves managing the exothermic heat of adsorption and 
endothermic heat of desorption, both of which reduce the usable 
capacity of an adsorbent. These heat effects can be substantial, with 
temperature changes of as much as 80°C observed during testing 
of prototype activated carbon-based ANG systems, and result in 
large reductions in the usable CH, capacity*””?. On-board ther- 
mal-management systems are essential to minimizing the negative 
impacts of the heats of sorption, but these engineering controls take 
up already limited space on a vehicle and add considerable cost and 
complexity”*. 

Responsive adsorbents, such as Co(bdp), offer the possibility of 
managing heat intrinsically within a material, rather than through an 
external system, by using the enthalpy change of a phase transition 
to partially, or perhaps even fully, offset the heats of sorption. For 
Co(bdp), the expansion of the framework during adsorption is endo- 
thermic, because energy is needed to overcome the greater thermody- 
namic stability of the collapsed phase. As a result, some of the enthalpy 
of CH, adsorption should go towards providing the heat needed for 
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Figure 4 | Effect of mechanical pressure on CH, storage in Co(bdp). 

a, Space-filling models of collapsed (left) and CH4-expanded (right) 
Co(bdp); purple, grey, blue, and white spheres represent Co, C, N, and H 
atoms, respectively. b, Excess CH, adsorption isotherms for Co(bdp) at 
25°C with different levels of applied external mechanical pressure, indicated 
by the inset, colour-coded bulk powder densities, with higher densities 
corresponding to greater applied mechanical pressure. The maximum 

CH, pressure for which hysteresis is still present is indicated for each bulk 
density by the appropriately coloured dashed line. Filled circles represent 
adsorption; open circles represent desorption. 


the transition to the expanded phase, lowering the overall amount of 
heat released compared to adsorption in the absence of a phase tran- 
sition. Similarly, the transition to the collapsed phase is exothermic, 
and some of the heat released by the framework as it collapses should 
offset the endothermic desorption of CH,. 

In classical porous materials, low-coverage differential CH, adsorp- 
tion enthalpies are generally —12 kJ mol~! CH, to —15kJ mol! CH, for 
adsorbents that do not have any strong CH, binding sites and are closer 
to —15kJ mol~! to —25kJ mol! for adsorbents with the highest volu- 
metric CH, capacities”®. For the steepest region of the CH, adsorption 
isotherm of Co(bdp), the differential enthalpy is considerably lower, at 
just —8.4(3) kJ mol”! (where the uncertainty corresponds to +1 stand- 
ard deviation), because the endothermic framework expansion partially 
offsets the exothermic heat of adsorption (Fig. 3c). After the transition 
to the expanded Co(bdp) phase is complete, the differential enthalpy 
approaches —13kJ mol !, which is consistent with weak CH, physical 
adsorption in the absence of a phase transition to mitigate heat. To 
confirm the accuracy of the calculated differential enthalpies, the heat 
released during CH, adsorption was directly measured by performing 
variable-pressure microcalorimetry experiments. As shown in Fig. 3c, 
the differential enthalpies obtained from calorimetry are in excellent 
agreement with those calculated from the variable-temperature adsorp- 
tion isotherms. 

The total amount of heat released when increasing the pressure of 
CH, adsorbed in Co(bdp) from 5.8 bar to 35 bar, as would occur during 
refuelling of an ANG vehicle, is calculated by integrating the differential- 
enthalpy curve with respect to the amount of CH, adsorbed. The 73.4kJ 
of heat released per litre of Co(bdp) represents a 33% reduction rela- 
tive to the 109 kJ 17! of heat released by HKUST-1 under the same 
conditions, even though the amount of CH, adsorbed in Co(bdp) 
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is 8% greater. We further calculate that 93.9 kJ 1"! of heat would be 
released for hypothetical CH, adsorption in a rigid Co(bdp) frame- 
work—28% higher than when adsorption occurs with a phase transi- 
tion to provide heat mitigation”®. 

By chemically modifying Co(bdp), we hypothesized that it might 
be possible to obtain a new flexible framework with a similar 
stepped CH, isotherm, but a higher-energy phase transition that 
could provide even greater intrinsic heat management. Because one- 
dimensional chains are known to form with tetrahedral Fe** ions bridged 
by u’-pyrazolates”®, we anticipated that it might be possible to synthe- 
size an isostructural iron analogue of Co(bdp). By heating FeCl, and 
Hbdp in a mixture of N,N-dimethylformamide (DMF) and methanol, 
we indeed obtained Fe(bdp) as yellow, block-shaped crystals. X-ray anal- 
ysis of a DMF-solvated crystal (Extended Data Fig. 6) confirmed that 
Fe(bdp) is isostructural to Co(bdp). Fe(bdp) has a stepped high-pressure 
CH, isotherm at 25°C (Fig. 1d), suggesting that this new compound 
also undergoes a reversible phase transition between a collapsed and 
expanded framework. Although the total CH, uptake is comparable to 
that of Co(bdp), the adsorption and desorption steps occur at the con- 
siderably higher pressures of 24bar and 10bar, respectively, suggesting 
that replacing Co with Fe increases the energy of the phase transition. 

In situ powder X-ray diffraction experiments from 0 bar to 50 bar 
of CH, (Fig. 2b) and subsequent Rietveld refinements afforded the 
collapsed and CH,-expanded crystal structures of Fe(bdp). Although 
the collapsed phase is nearly identical to that of Co(bdp), with edge- 
to-face m-m interactions and no accessible porosity, the volume of the 
expanded Fe(bdp) phase at 40 bar is 9% greater than that of Co(bdp) 
(Fig. 2f). In contrast to Co(bdp), we observe a second transition for 
Fe(bdp) at pressures above 40 bar, wherein Fe(bdp) slightly expands to a 
framework with nearly perfect square channels (Extended Data Fig. 6). 
In spite of its greater expansion and lower crystallographic density, the 
usable CH, capacity of Fe(bdp) is still higher than all known adsorbents 
at 150 v/v and 190 v/v for 35 bar and 65 bar adsorption, respectively. 

Although Fe(bdp) and Co(bdp) have similar usable capacities, the 
initial Fe(bdp) phase transition offsets more heat, and only 64.3 kJ of 
heat is released per litre of adsorbent during CH, adsorption at 35 bar, 
which is 12% lower than for Co(bdp) and 41% lower than for HKUST-1. 
This is a direct consequence of the larger increase in the enthalpy of 
Fe(bdp) (8.1 kJ mol~') than of Co(bdp) (7.0kJ mol~!) during the phase 
transition, which mitigates more heat of adsorption, thereby provid- 
ing a greater source of intrinsic thermal management. This result 
demonstrates how a slight variation in the metal-organic framework 
can be used to improve its intrinsic thermal management, and it is 
very likely that similar effects will prove possible through alteration of 
the bdp*~ bridging ligand. 

Examining the temperature dependence of the CH, isotherms of 
Co(bdp) and Fe(bdp) (Extended Data Figs 2, 3) reveals another advan- 
tage of these materials, involving a reduction in the effect of cooling 
during desorption. Consistent with other gate-opening metal-organic 
frameworks, the CH, adsorption and desorption steps in Co(bdp) 
and Fe(bdp) shift to lower pressures at lower temperatures (Fig. 3a, b). 
As long as the temperature stays above 0°C in Co(bdp) or —25°C in 
Fe(bdp), however, the transition to the collapsed phase occurs above 
5.8 bar, and the usable CH, capacity will not be affected by cooling 
(Supplementary Tables 2, 3). This property has practical benefits for 
driving in cold-weather climates and should further reduce the overall 
thermal management required in an ANG system. 

Recent work?’~”? has shown that it is possible to induce a phase 
transition in flexible metal-organic frameworks by applying external 
mechanical pressure. With this in mind, we proposed that applying 
moderate mechanical pressure could provide a means of further tun- 
ing the CH, adsorption and desorption step pressures in Co(bdp) and 
Fe(bdp) and of increasing the energy of the phase transition to offset 
more heat. To investigate this concept, high-pressure CH, adsorption 
isotherms were measured for Co(bdp) at different levels of applied uni- 
axial mechanical pressure. 
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At higher mechanical pressures and higher compaction densities, 
both the adsorption and desorption isotherm steps shift to higher 
CH, pressures, which is consistent with an increase in the energy of 
the phase transition (Fig. 4). In addition, the isotherm hysteresis loop 
remains open until higher CH, pressures, with hysteresis observed to 
pressures of at least 70 bar for the highest applied mechanical pressure. 
Because hysteresis at a given pressure implies that a phase transition 
is still occurring*®, this result suggests that some Co(bdp) crystallites 
are expanding at much higher CH, pressures when under an applied 
external mechanical pressure. Because Co(bdp) crystallites in a bulk 
powder will be at different orientations with respect to the direction 
of uniaxial compression (Extended Data Fig. 4), there will be a distri- 
bution of local mechanical pressures experienced by different crystal- 
lites. Crystallites that experience higher external pressures will have a 
greater free energy change associated with the phase transition and will 
open at higher pressures*!. Overall, these results present the prospect 
of using mechanical work, such as provided through an elastic bladder, 
as a means of thermal management in a gas-storage system based on 
a flexible adsorbent. 

Designing new flexible adsorbents with stronger gas binding sites 
and higher-energy phase transitions provides a promising route to 
achieving even higher usable capacities and greater intrinsic heat 
management in a next generation of gas-storage materials. Moreover, 
improved compaction and packing strategies should allow further 
reductions to external thermal-management requirements and opti- 
mization of the overall storage-system performance. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Materials. Anhydrous N,N-dimethylformamide (DMF) was obtained from a JC 
Meyer solvent system. The ligand 1,4-benzenedipyrazole (H2bdp) was synthesized 
according to a literature procedure’. All other reagents were obtained from com- 
mercial vendors and used without further purification. Ultra-high purity grade 
(99.999% purity) helium, dinitrogen, and methane were used for all adsorption 
measurements. 

Synthesis of Co(bdp). The compound Co(bdp) was synthesized using a strat- 
egy adopted from a previous report!”. Specifically, a 500-ml solvent bomb was 
charged with a magnetic stirring bar, Co(CF3SO3)2 (4.96 g, 0.0139 mol), H2bdp 
(2.46 g, 0.0117 mol), and N,N-diethylformamide (90 ml). The reaction mixture 
was degassed by the freeze-pump-thaw method for 5 cycles then sealed by clos- 
ing the stopcock of the solvent bomb while the frozen reaction mixture was still 
under vacuum. The solvent bomb was then heated at 160°C for 4.5 days to afford 
a purple microcrystalline solid. The solvent bomb was backfilled with N>, and 
the solid was collected by filtration. Before drying, the wet solid powder was 
immediately transferred to a 500-ml glass jar, and 400 ml of DMF was added. The 
jar was heated at 120°C for 12h, then cooled to room temperature. The DMF was 
decanted and replaced with 400 ml of fresh DME. The jar was reheated at 120°C, 
followed by decanting and replacing with fresh DMF. This was repeated four 
additional times. The DMF was then decanted and replaced with dichloromethane 
(DCM). The DCM was partially decanted until 50 ml of solution was remaining. 
The resultant slurry was transferred to a 100-ml Schlenk flask, and the DCM was 
evaporated by flowing N> at room temperature. The resultant solid was dried by 
flowing N2 at 160°C for 12h, then placed under dynamic vacuum at 160°C for 
24h. The activated solid was immediately transferred to a glovebox and handled 
under a N> atmosphere for all further experiments. 

Synthesis of Fe(bdp). In a glovebox under a N2 atmosphere, Habdp (0.200 g, 
0.95 mmol) in DMF (9 ml) was heated to 120°C while stirring for 20 min in a 
20 ml glass vial. The resultant yellow suspension was cooled. A solution of FeCl, 
(0.197 g, 1.55 mmol) in methanol (1 ml) was added to the cooled suspension of 
H,(bdp) in DMF, and the vial was sealed and heated at 120°C while stirring. The 
hot, orange-yellow solution yielded a yellow microcrystalline powder after several 
hours. Samples suitable for gas adsorption studies were prepared using multiple 
vials of the same reaction scale in a glovebox under a N> atmosphere and by 
washing the resultant material nine times with hot DMF (9 x 18 ml), before drying 
under high vacuum at 170°C for 24h. The activated sample was handled under a 
N) atmosphere for all further experiments. IR (neat, cm~'): 1,573 (s), 1,336 (w), 
1,239 (s), 1,110 (s), 1,041 (s), 952 (s), 859 (s), 849 (s), 832 (s), 824 (s), 644 (s), 
534 (s). Anal. Calcd for FeC}zHgNu: C, 54.58; H, 3.05; N, 21.22. Found: C, 54.18; 
H, 2.36; N, 20.67. To obtain single crystals suitable for X-ray diffraction, a 9:1 
mixture of DMF and methanol was used to create solutions of FeCl, (9.0 mg, 
0.07 mmol in 0.1 ml solvent) and Habdp (4.0 mg, 0.019 mmol in 0.9 ml solvent). 
The FeCl solution and the H2bdp solution were added together in a 4-ml vial. The 
vial was then sealed, and the clear yellow solution was heated at 120°C for 24h. 
Block-shaped yellow crystals formed on the sides of the vial after several hours. 
Low-pressure gas adsorption measurements. Gas adsorption isotherms for pres- 
sures in the range of 0-1.1 bar were measured using a Micromeritics ASAP 2020 
or 2420 instrument. Activated samples were transferred under a N> atmosphere 
to preweighed analysis tubes, which were capped with a Transeal. Each sample 
was evacuated on the ASAP until the outgas rate was less than 3 ubar min’. The 
evacuated analysis tube containing degassed sample was then carefully transferred 
to an electronic balance and weighed to determine the mass of sample (typically 
100-200 mg). The tube was then fitted with an isothermal jacket and transferred 
back to the analysis port of the ASAP. The outgas rate was again confirmed to 
be less than 3 ubar min” '. Langmuir surface areas were determined by meas- 
uring N> adsorption isotherms in a 77-K liquid N> bath and calculated using 
the Micromeritics software, assuming a value of 16.2 A? for the molecular cross- 
sectional area of N>. The Langmuir surface areas of Co(bdp) and Fe(bdp) are 
2,911 m? g! and 2,780 m? g“ |, respectively. Full 77-K N> adsorption isotherms 
for Co(bdp) and Fe(bdp) can be found in Supplementary Fig. 1. Note that BET 
surface areas cannot be accurately determined for either framework because of the 
steps in the low-pressure region of the 77-K N2 adsorption isotherms. 
High-pressure CH, adsorption measurements. High-pressure CH, adsorp- 
tion isotherms in the range of 0-70 bar were measured on an HPVA-II-100 from 
Particulate Systems, a Micromeritics company. In a typical measurement, 0.5-1.0g 
of activated sample was loaded into a tared stainless steel sample holder inside a 
glovebox under a N2 atmosphere. Prior to connecting the sample holder to the 
VCR fittings of the complete high-pressure assembly inside the glovebox, the 
sample holder was weighed to determine the sample mass. The sample holder 
was then transferred to the HPVA-II-100, connected to the instrument’s analysis 
port via an OCR fitting, and evacuated at room temperature for at least 2h. The 


sample holder was placed inside an aluminium recirculating Dewar connected 
to a Julabo FP89-HL isothermal bath filled with Julabo Thermal C2 fluid. The 
temperature stability of the isothermal bath is +0.02°C. Methods for accurately 
measuring the relevant sample freespaces, which involve the expansion of He 
from a calibrated volume at 0.7 bar and 25°C to the evacuated sample holder, were 
described in detail previously*. Non-ideality corrections were performed using 
the CH, compressibility factors tabulated in the NIST REFPROP database* at 
each measured temperature and pressure. 

A sample size of 1.032 g was used for the 25-°C usable capacity calculations, 
compaction studies, and cycling studies with Co(bdp), whereas a sample size of 
0.584 g was used for the variable-temperature measurements. For Fe(bdp), a sam- 
ple size of 0.274 g was used for high-pressure adsorption measurements, with the 
exception of the isotherms measured at —12°C and —25°C for which a sample 
size of 0.322 g was used. 

To determine the usable CH, capacity of Co(bdp) and Fe(bdp), experimen- 

tally measured excess gravimetric adsorption data (Extended Data Fig. 3) were 
converted to total volumetric adsorption data using the pore volume and crys- 
tallographic density of the CH4-expanded phases (see Supplementary Text for 
details). All usable capacity calculations assume a minimum desorption pressure 
of 5.8 bar. Although the minimum desorption pressure required for natural gas to 
flow from the adsorbent to the combustion engine can vary from 3.5 bar to 10 bar 
depending on the specific requirements of fuel injectors, filters, and other engine 
components, a value of 5.8 bar has been adopted by many groups”! for initial 
materials comparisons. 
High-pressure CH, adsorption measurements under applied mechanical 
pressure. For the high-pressure CH, adsorption measurements of Co(bdp) at 
different applied mechanical pressures, an aluminium sample holder was designed 
and used (Extended Data Fig. 5). The sample is loaded in the volume between 
the fritted and blank gaskets. The free volume between the fritted and blank 
gaskets in the absence of a sample was determined by expansion of He from a 
calibrated volume to be 5.242 ml. Initially, 1.032 g of Co(bdp) was loaded into this 
volume, resulting in a bulk density of 0.197 g ml! for the uncompacted powder. 
After measuring a high-pressure CH, adsorption isotherm, the sample holder 
was returned to a glovebox under a N2 atmosphere, and the cell was opened by 
removing the cap behind the blank gasket. An aluminium rod with an outer diam- 
eter slightly less than the inner diameter of the sample holder was then inserted. A 
mechanical press was used to compact the sample by pushing down on the rod. A 
fresh blank gasket was then sealed behind the rod so that the rod was left pressed 
against the sample, with a continuously applied uniaxial mechanical pressure. The 
sample holder was returned to the high-pressure instrument and fully evacuated 
before measuring a high-pressure CH, adsorption isotherm. This experiment was 
repeated after inserting additional metal rods to further compact the Co(bdp), 
increase the applied mechanical pressure, and reduce the sample volume. Packing 
densities for each experiment were calculated by subtracting the volume of each 
rod from the original sample volume. 

The decrease in the total amount of CH, adsorbed at higher mechanical pres- 

sures (Fig. 4) is not due to framework degradation, as is often observed when 
compacting classical adsorbents’, and can instead be explained by insufficient CHy 
pressure to induce a phase transition in some crystallites and by a lack of sufficient 
free volume for all crystallites to expand into. To confirm this, a CH, adsorption 
isotherm was measured after compacting collapsed Co(bdp) to a packing density 
of 0.75 g cm 3, which is just below the crystallographic density of the expanded 
phase, and releasing the applied mechanical pressure by removing the metal rod. 
The resulting isotherm was found to be nearly identical to the pre-compaction 
isotherm, demonstrating that all Co(bdp) crystallites could once again fully expand 
(Extended Data Fig. 5). 
Powder X-ray diffraction measurements. Powder X-ray diffraction data for 
Co(bdp) and Fe(bdp) were collected on beamline 17-BM-B at the Advanced 
Photon Source (APS) at Argonne National Laboratory and beamline MS-X04SA 
at the Swiss Light Source (SLS) at the Paul Scherrer Institut (Extended Data Fig. 
7). For variable CH, pressure experiments, approximately 10 mg of fully desol- 
vated framework was loaded into 1.5-mm quartz glass capillaries inside a glove- 
box under a N2 atmosphere. Each capillary was attached to a custom-designed 
gas-dosing cell, which is equipped with a gas valve, and was then transferred to 
the goniometer head. All adsorbed N» was removed by evacuating in situ using 
a turbomolecular pump. A cryostat was used to hold the temperature constant 
at 25°C, and variable pressures of CH, were dosed to the samples. Diffraction 
data were collected after allowing each dose to equilibrate for several minutes. 
All X-ray wavelengths were between 0.72 A and 0.78 A, and are specified for each 
experiment in the relevant figures and tables. 

The structure solution and refinement procedure used in this study followed the 
standard protocol developed by us, and others, for the structural characterization 
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of polycrystalline samples of non-ideal crystallinity and moderately complex struc- 
tures by ab initio powder diffraction methods. The specific details of the crystal 
structure determinations are discussed in the Supplementary Information, but 
the general procedure, which was fully adopted for the collapsed Co(bdp) phase, 
is summarized here. A standard peak search, followed by peak profile fitting was 
first used to determine accurate peak positions of several well-separated low-angle 
peaks. These peak positions were used to obtain approximate lattice parameters via 
the single-value-decomposition indexing procedure implemented in the software 
TOPAS-R (Bruker AXS, version 3.0, 2005), which were later refined by the struc- 
tureless Le Bail method as implemented in TOPAS-R. Systematic absences, density 
considerations and previous knowledge of isotypic species coherently allowed the 
derivation of the correct space group, which was later confirmed by successful 
structure solution and Rietveld refinement. The structural model was derived using 
the simulated annealing procedure as implemented in TOPAS-R, which is a real- 
space structure-solution technique, with a single freely floating metal ion and an 
idealized half bdp*~ ligand defined using z-matrix formalism. In the collapsed 
phase of Co(bdp), for instance, the metal atom was located on a two-fold axis at 
(0, y, 1/4), whereas the half bdp?- ligand was hinged about the inversion centre at 
(1/4, 3/4, 0). Once an initial structural model was established, complete Rietveld 
refinements were performed in the software TOPAS-R. The background was mod- 
elled with Chebyshev polynomials, and Lorentz and absorption correction factors 
were applied. A single isotropic B value was attributed to all atoms, and found to 
act, as expected, as a scavenger for 0-dependent systematic errors, which are not 
suitably taken into account in the data-reduction process. After the retrieval of the 
lattice metrics and space-group symmetry for the CH4-expanded phases, defining 
a starting structural model was straightforward, because it is implicit in the iso- 
typic nature of the compounds. The contribution of the (probably tumbling, but 
not necessarily randomly located) CH, molecules to the overall scattering power 
was neglected, which probably contributes to the decreased physical meaning of 
the atomic displacement parameter values, as is common for crystal structures 
determined from powder diffraction data. 

As indicated in the main text, the peak widths of the collapsed Co(bdp) and 
Fe(bdp) could not be modelled by convoluting conventional Lorentzian and 
Gaussian functions (or their combinations) with systematic 1/cos(@) or tan(0) 
dependency, respectively, or with smooth hkl-dependent models (such as spher- 
ical harmonics). Instead, we began by separately modelling the hkl peak widths 
as distinct from the axial reflections of the h00- and 0k0-type peak widths using a 
purely phenomenological model. We also developed a paracrystalline model for 
collapsed Co(bdp) (discussed in Supplementary Text) that is also representative 
of the collapsed Fe(bdp) phase. 

Single-crystal X-ray diffraction measurements. X-ray diffraction analyses were 
performed on a single crystal of Fe(bdp) that was coated with Paratone-N oil 
and mounted on a MiTeGen loop. The crystal of Fe(bdp) was first kept frozen at 
100 K by an Oxford Cryosystems Cryostream 800 plus, and after a full data col- 
lection, the crystal was warmed to 298 K for a second data collection. Diffraction 
data for Fe(bdp) was collected at beamline 11.3.1 at the Advanced Light Source, 
Lawrence Berkeley National Laboratory using synchrotron radiation (wavelength 
A= 0.7749 A) with 1° omega scans for the 100-K structure, and 4° phi and 1° 
omega scans for the 298 K structure. A Bruker PHOTON100 CMOS diffractom- 
eter was used for data collection, and the corresponding Bruker AXS APEX II 
software was used for data collection and reduction. Raw data were integrated 
and corrected for Lorentz and polarization effects using the Bruker AXS SAINT 
software. Absorption corrections were applied using TWINABS for the 100-K 
structure and SADABS for the 298-K structure. Space-group assignments were 
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determined by examination of systematic absences, E statistics, and successive 
refinement of the structures of Fe(bdp) at 100 K and 298 K. The structures were 
solved using direct methods with SHELXS and refined using SHELXL operated 
in the OLEX2 interface. Thermal parameters were refined anisotropically for all 
non-hydrogen atoms. Hydrogen atoms were placed in ideal positions and refined 
using a riding model for all structures. 

The crystal was determined to be twinned at 100K and a suitable unit cell was 
determined that is similar to that previously reported for Co(bdp)e3DMF in the 
space group P2,/c (ref. 33). The program CELL_NOW was used to determine the 
orientation matrices, and the domains were found to related by a 179.9° rotation 
around the reciprocal axis [0.5, 0, 1]. Raw data for both matrices were integrated 
and corrected for absorption using TWINABS. Solution and refinement of the 
data in P2,/c required substantially fewer restraints in structure refinement and 
gave much lower values for R, compared to those solved in other space groups. 
Solvent molecules could be refined anisotropically in the crystal of Fe(bdp) at 
100 K, accounting for all pore void space. 

When the crystal was warmed to 298 K, the space group was determined to 
be C222, instead of P2;/c and was refined as an inversion twin (batch scale fac- 
tor, BASF = 0.52(4); the uncertainty corresponds to +1 s.d.). At 298 K, there was 
extensive solvent disorder that could not be modelled. A solvent mask was applied, 
as implemented in OLEX2, to account for unassigned electron density within the 
pores. The loss in intensity of spots upon warming to 298 K, and the large aniso- 
tropic displacement parameters that result from linker and solvent disorder, gave 
rise to A- and B-level alerts from checkCIF (http://checkcif.iucr.org). Responses 
addressing these alerts have been included in the CIF (crystallographic informa- 
tion file, available from the Cambridge Crystallographic Data Centre; see Author 
Information) and can be read in reports generated by checkCIEF. 

Scanning electron microscopy. Scanning electron microscopy (SEM) samples of 
Co(bdp) and Fe(bdp) were prepared by dispersing microcrystalline powders into 
DCM and drop casting onto a silicon chip (Extended Data Fig. 4). To dissipate charge, 
the samples were sputter coated with approximately 3nm of Au (Denton Vacuum). 
Crystals were imaged at 5 keV and 121A by field emission SEM (JEOL FSM6430). 
Microcalorimetry measurements. Approximately 0.2 g of Co(bdp) was used for 
combined microcalorimetry and high-pressure CH, adsorption experiments. 
Before each experiment, samples were outgassed ex situ at 423 K for 16h under 
a dynamic vacuum of 10? mbar. The microcalorimetry experiments were per- 
formed using a custom-built manometric adsorption apparatus coupled with 
a Tian—Calvet-type microcalorimeter**. This experimental device allows the 
simultaneous determination of the adsorption isotherm and the adsorption 
enthalpy using a point-by-point introduction of gas to the sample. A multi- 
pneumovalve system allows the introduction of the adsorbate to the sample. 
An exothermic thermal effect accompanied each introduction, which is due 
to both the adsorption process and gas compression. This peak in the energy 
curve with time is thus integrated to calculate a pseudo-differential enthalpy 
of adsorption for each dose. Errors in this calculation can be estimated at 
+1kJ mol!. Experiments were carried out at 303 K and up to 70 bar with CHy 
of a purity of above 99.999%. 


32. Lemmon, E. W., Huber, M. L. & McLinden, M. O. NIST Standard Reference 
Database 23: Reference Fluid Thermodynamic and Transport Properties— 
REFPROP Version 8.0 (National Institute of Standards and Technology, 2007). 

33. Lu, Y. et al. A cobalt(Il)-containing metal-organic framework showing catalytic 
activity in oxidation reactions. Z. Anorg. Allg. Chem. 634, 2411-2417 (2008). 

34. Llewellyn, P. L. & Maurin, G. Gas adsorption microcalorimetry and modeling to 
characterise zeolites and related materials. C. R. Chimie 8, 283-302 (2005). 


© 2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


= 
oO 


© 


Excess CH, adsorbed (mmol/g) 


0 10 20 30 40 50 60 70 
Pressure (bar) 


e@ O cycles 

e 25 cycles 
e 50 cycles 
e 75 cycles 
e@ 100 cycles 


Excess CH, adsorbed (mmol/g) © 


0 10 20 30 40 50 60 += 70 
Pressure (bar) 


Extended Data Figure 1 | High-pressure CH, cycling. a, Excess 

CH, isotherms at 25°C for Co(bdp) repeated four times on the same 
sample, which was regenerated under vacuum at 25°C for 2h between 
measurements. The adsorption step is at a slightly higher pressure during 
the first run because there is probably a slightly higher energy barrier to 
the first expansion of a freshly packed sample; however, the desorption 
steps occur at identical pressures for all four runs. b, The adsorption and 
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desorption pressures are shown as green and red circles, respectively, for 

100 CHy adsorption-desorption cycles in Co(bdp) at 25°C. ¢, Excess CHy 
adsorption isotherms at 25°C for Co(bdp) after 0, 25, 50, 75, and 100 

cycles of 35-bar adsorption and 5-bar desorption. d, Excess CH, isotherms 
at 25°C for Co(bdp) before (green) and after (blue) the 100 adsorption— 
desorption cycles between 35 bar and 5 bar. Filled and open circles ina and d 
correspond to adsorption and desorption, respectively. 
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Extended Data Figure 2 | Adsorption isotherm fitting. a, Total CH, 
adsorption isotherms at 0°C, 12°C, 25°C, 38°C, and 50°C for Co(bdp), 
with adsorption after the step fitted independently at each temperature 
with an offset dual-site Langmuir—Freundlich equation. The small 

pre-step adsorption was fitted with a single-site Langmuir model. b, Total 
CH, adsorption isotherms at 0°C, 12°C, 25°C, 38°C, and 50°C for Fe(bdp) 
with adsorption after the phase transition fitted independently at each 
temperature with an offset dual-site Langmuir—Freundlich equation. The 
pre-step adsorption was fitted with a single-site Langmuir model, and the 
isotherms were only fitted to a maximum loading of 10.6 mmol g“', as 
indicated by the shading, to avoid complications from the second transition 
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at higher CH, loadings. As such, differential enthalpies are only calculated 
up to a maximum loading of 10.6 mmol g“!. c, Total CH, adsorption 
isotherms at 0°C, 12°C, 25°C, 38°C, and 50°C for Co(bdp) with the 
corresponding single-site Langmuir fit for CH, adsorption in the expanded 
phase. d, Total CH, adsorption isotherms at 0°C, 12°C, 25°C, 38°C, and 
50°C for Fe(bdp) with the corresponding single-site Langmuir fit for CH, 
adsorption in the 40-bar expanded phase. The data were only fitted for 

the region of the isotherms that falls after the initial hysteresis loop closes 
and before the second isotherm step. All single- and dual-site Langmuir- 
Freundlich fits are shown as black lines. 
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Extended Data Figure 3 | Excess CH, adsorption data. a, Excess CH, adsorption isotherms at — 25°C, 0°C, 12°C, 25°C, 38°C, and 50°C for Co(bdp). 
b, Excess CH, adsorption isotherms at — 25°C, — 12°C, 0°C, 12°C, 25°C, 38°C, and 50°C for Fe(bdp). Filled and open circles correspond to adsorption and 


desorption, respectively. 
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Extended Data Figure 4 | SEM images. a, SEM image of DMF-solvated Co(bdp) microcrystalline powder. Scale bar, 10 um. b, SEM image of Co(bdp) 
microcrystalline powder after more than 100 CH, adsorption-desorption cycles. Scale bar, 1 um. c, SEM image of desolvated Fe(bdp) microcrystalline 
powder. Scale bar, 1 um. 
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Extended Data Figure 5 | Effect of mechanical pressure. a, Sample holder pressure (and constricted volume) is maintained throughout the high- 
used for combined applied mechanical pressure and high-pressure CH4 pressure CH, adsorption experiment. b, Excess CH, isotherms at 25°C for 
adsorption experiments. The sample is located in the volume to the right Co(bdp) before (green) and after (purple) the applied mechanical pressure 
of the fritted gasket and to the left of the blank gasket. A press is used to studies. Filled and open circles correspond to adsorption and desorption, 
compact metal rods of different lengths against the sample, and the blank respectively. 
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Extended Data Figure 6 | Solid-state structures. a, b, The angles between of Fe(bdp). d, Structure of the collapsed phase of Fe(bdp) under vacuum 
the plane of the pyrazolate (light orange) and the Co-N-N-Co plane (light at 25°C. e, Structure of the DMF-solvated phase of Fe(bdp) at 100K. 

blue) are 38.1° and 17.3° in the collapsed and the CH4-expanded phases of f, Idealized average structure of the 50-bar CH4-expanded phase of Fe(bdp) 
Co(bdp), respectively. c, The angle between the plane of the pyrazolate (light at 25°C. In a-f, Grey, blue, red, purple, and orange spheres represent C, N, 
orange) and the Fe-N-N-Fe plane (light blue) is 40.1° in the collapsed phase _—_O, Co, and Fe atoms, respectively; H atoms are omitted for clarity. 
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Extended Data Figure 7 | Powder X-ray diffraction. a-d, Rietveld 
refinements for powder X-ray diffraction data (26 is the diffraction 

angle) for Co(bdp) at 25°C and under vacuum with A = 0.77475 A (a), for 
Co(bdp) at 30 bar of CHy and 25°C with A = 0.75009 A (b), for Fe(bdp) 
under vacuum at 25°C with A = 0.72768 A (c), and for Fe(bdp) at 40 bar 
of CH, and 25°C with A = 0.72768 A (d). Red and blue lines represent 

the observed and calculated diffraction patterns, respectively. Grey lines 
represent the difference between observed and calculated patterns, and 
green and orange tick marks indicate calculated Bragg peak positions. The 
broad hump observed at 10° in the diffraction patterns is due to diffuse 
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scattering from the sample holder (a thick-walled quartz glass capillary). 
The insets are magnified views of the main plots. e, Powder X-ray diffraction 
data for Fe(bdp) at 50 bar of CH, and 25°C (A = 0.72768 A). Green tick 
marks indicate Bragg angles for space-group-permitted reflections; the 
corresponding Miller indices are indicated for the most prominent peaks. 
Blue arrows indicate broad humps where multiple reflections overlap. f, The 
percentage of the expanded phase of Co(bdp) that is present in the variable- 
pressure experimental powder X-ray diffraction patterns as a function of 
CH, pressure. The filled squares represent data collected during adsorption; 
the open squares represent data collected during desorption. 
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Extended Data Figure 8 | Paracrystalline model. a, An illustration of the 
paracrystalline distortion in the crystallographic a—b plane of the collapsed 
phases of Co(bdp) and Fe(bdp) that leads to complex Bragg peak 
broadening. Black dashed lines represent the periodic crystal lattice; blue 
lines represent the paracrystal. Red circles represent the positions of metal- 
pyrazolate chains in the periodic lattice; blue circles represent their positions 
ina paracrystal. The magnitude of the paracrystalline distortion has been 
exaggerated for clarity. b, Simulated diffraction patterns are shown for a 
periodic collapsed Co(bdp) nanocrystal (75 nm x 60nm x 43 nm; red trace) 


and for a paracrystal of equivalent size (blue trace). The upper trace (black) 
corresponds to the background-subtracted experimental diffraction pattern 
of the collapsed phase of Co(bdp) at 25°C; the corresponding Miller indices 
are indicated for the most prominent peaks. For clarity, the three patterns 
have been given an arbitrary y offset; a.u., arbitrary units. Similar anisotropic 
peak broadening, which inflates hkO peaks (but not h00 or 0k0 ones), is 
clearly visible in the experimental diffraction pattern and the paracrystalline 
simulation. The exact full-widths at half-maximum for the experimental and 
simulated Bragg peaks are given in Supplementary Table 16. 
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North Pacific deglacial hypoxic events linked to 


abrupt ocean warming 


S. K. Praetorius!+, A. C. Mix!, M. H. Walczak!, M. D. Wolhowel, J. A. Addison? & FE. G. Prahl! 


Marine sediments from the North Pacific document two episodes 
of expansion and strengthening of the subsurface oxygen minimum 
zone (OMZ) accompanied by seafloor hypoxia during the last 
deglacial transition!*. The mechanisms driving this hypoxia 
remain under debate’“!!. We present a new high-resolution alkenone 
palaeotemperature reconstruction from the Gulf of Alaska that 
reveals two abrupt warming events of 4-5 degrees Celsius at the 
onset of the Bolling and Holocene intervals that coincide with 
sudden shifts to hypoxia at intermediate depths. The presence 
of diatomaceous laminations and hypoxia-tolerant benthic 
foraminiferal species, peaks in redox-sensitive trace metals!”!>, 
and enhanced !°N/"4N ratio of organic matter?°, collectively suggest 
association with high export production. A decrease in 80/!°O 
values of benthic foraminifera accompanying the most severe 
deoxygenation event indicates subsurface warming of up to about 
2 degrees Celsius. We infer that abrupt warming triggered expansion 
of the North Pacific OMZ through reduced oxygen solubility and 
increased marine productivity via physiological effects; following 
initiation of hypoxia, remobilization of iron from hypoxic sediments 
could have provided a positive feedback on ocean deoxygenation 
through increased nutrient utilization and carbon export. Such a 
biogeochemical amplification process implies high sensitivity of 
OMZ expansion to warming. 

Models suggest enhanced ocean deoxygenation in response to future 
global warming, owing to both a reduction in oxygen solubility and 
decreased subsurface ventilation related to thermal stratification'*". 
Uncertainty in these projections reflects weak constraints on the 
response of marine primary productivity to warming! and the extent 
to which ecosystem changes will translate to carbon export and 
remineralization”’, thus altering subsurface oxygen demand. Once 
hypoxia is initiated, seafloor biogeochemical cycling may sustain 
and enhance low-O, conditions via a threshold effect!®. For example, 
reductive mobilization of sedimentary iron may further stimulate 
primary productivity in surface waters of high-nitrate-low-chlorophyll 
(HNLC) regions if it occurs within an ocean depth range susceptible 
to mixing into the euphotic zone; this in turn would increase oxygen 
demand in underlying waters, until sulfidic conditions ensue and limit 
further supplies of dissolved iron from the sediments to the upper water 
column”. 

Data from intermediate water depth marine sediment cores across 
the North Pacific (Fig. 1) document expansion and strengthen- 
ing of the OMZ during the Bolling—Allerod interstade (14.7-12.9 
thousand years ago (ka)) and earliest part of the Holocene intergla- 
cial (11.5-10.5ka)!"*. Mechanisms proposed to account for deglacial 
hypoxia include a decrease in ocean ventilation related to changes in 
ocean circulation*, and increased oxygen demand related to enhanced 
export productivity® 11. 

So far, no consistent evidence links decreased ventilation rate with 
these hypoxic events. Benthic-planktonic radiocarbon age differences 
on the Gulf of Alaska margin show no significant deviation from the 


long-term mean (725 + 200 yr) during either of the hypoxic events 
(Fig. 2), and when benthic-planktonic increases do occur (during the 
Younger Dryas interval; 12.9-11.7ka), they are not associated with 
hypoxia”®. 

Mechanisms to stimulate productivity over such widespread ocean- 
ographic settings remain unclear. However, once hypoxia is initiated 
in a subsurface water mass, it may spread over a greater area than the 
region of elevated productivity®, because downward mixing of oxygen 
beyond the range of shallow wind-driven mixing is small. Iron release 
from continental shelves in response to sea-level rise has been suggested 
as a potential driver of increased productivity in HNLC regions such 
as the subpolar North Pacific!”. However, regional sea-level histories 
differ owing to isostatic responses to ice unloading. For the sea-level 
mechanism to work, long-distance transport of iron would be required. 

Alternative hypothetical mechanisms to stimulate North Pacific 
productivity have invoked increased upwelling of macronutrients dur- 
ing hypoxic events’. This would require reduced stratification, para- 
doxically at a time of northern hemisphere warming and enhanced 
freshwater input to the marginal ocean from melting ice sheets. This 
mechanism does not account for the fact that macronutrients such 
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Figure 1 | Study area and core locations. Top, colour shading on map 
shows modern oxygen concentration at 400 m depth, with the clearly 
defined OMZ across the North, Eastern, and Equatorial Pacific. The core 
site EW0408-85JC in this study is indicated with a red star; other core sites 
that document deglacial hypoxic/productivity events are indicated with 
red circles*. Bottom, meridional cross-section of oxygen concentration in 
the modern Eastern Pacific, with the location of core sites used in the 68O 
depth transect (Fig. 4). Maps were generated with Ocean Data View”. 
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Figure 2 | Data from core EW0408-85JC. a, Planktonic 5'8O data (Nps: 
dark blue, Gb: light blue)’. b, Alkenone palaeotemperature (purple) and 
error estimates (light purple bars). c, Biogenic opal percentages relative 

to bulk sediment (blue)”. d, Relative abundance of low-oxygen benthic 
foraminifera (Bolivina and Bulimina genera: light purple, Bulimina exilis: 
black). e, Redox-sensitive trace metal concentrations (Mo is light green, 

U is dark green) in excess values relative to lithogenic background (dashed 
line)!*!9, £, Sedimentary 5'°N (dark violet)'*. g, Benthic-planktonic 
radiocarbon age differences (black) and benthic projection ages calculated 
with respect to atmospheric AMC (blue)”°. h, Reconstructions of near- 
surface seawater 5!8O based on the planktonic species Gb (green) and 
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Nps (light violet), with depth-based pairs indicated as square symbols 
and values linearly interpolated at 100-yr intervals shown as a trend line. 
Age controls are from Davies-Walczak et al.”° and are indicated with 
blue diamonds at the bottom of the plot. The pink bars represent the two 
laminated intervals in core EW0408-85JC, the grey bar indicates the zone 
in which changes in SST, trace metals, and benthic fauna slightly precede 
the onset of laminations (dashed grey line) (expanded view in Extended 
Data Fig. 7). The timing of major climate intervals are indicated at the 
top of the plot: Holocene (11.6-0 ka), Younger Dryas (YD; 12.9-11.7 ka), 
Bolling—Allerod (BA; 14.6-12.9 ka), late glacial (18-14.7 ka). Nps, 
Neogloboquadrina pachyderma; Gb, Globigerina bulloides. 
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as phosphate and nitrate, supplied today by vertical mixing, do not 
limit productivity in the region. Further, most deep mixing occurs in 
winter, when light is limiting. Although deep upwelling also provides 
some iron, it is not currently enough to consume the macronutrients; 
relief of iron limitation in the past would require an additional iron 
source. Haline stratification due to ice melt may have enhanced marine 
productivity by reducing deep mixing of plankton out of the euphotic 
zone!®, Haline inhibition of deep mixing would, however, reduce the 
source of subsurface iron”!. 

Distinguishing among the various hypotheses requires separating 
sea-surface temperature (SST) and salinity effects on near-surface strat- 
ification and subsurface ventilation. Here we pair a new high- 
resolution palaeotemperature record based on the ses alkenone index 
from the Gulf of Alaska with benthic faunal assemblages and proxies 
for ventilation rate, export productivity, and surface stratification to 
evaluate the sequence of oceanographic changes leading to hypoxia in 
a marine sediment core located in the upper reaches of the modern 
OMZ (EW0408-85]C, 59° 33.32’ N, 144° 9.21/ W, 682 m depth). The 
various proxy indicators are co-registered with little or no impact from 
chronologic error, because they occur in the same sediment core. We 
find that two abrupt deglacial warming events of 4-5 °C coincide with 
increases in nutrient utilization, export productivity, and the sudden 
onset of hypoxia. Alkenone SST reconstructions from other sites in the 
North Pacific show similar trends (Extended Data Fig. 9), indicating 
these were significant regional temperature fluctuations. 

Palaeotemperatures in the northern Gulf of Alaska were lowest 
(~5°C) near 17.0ka, (Fig. 2), coincident with a peak in ice-rafted debris 
(IRD)*. Warming commenced near 16.5 ka, before the Bolling inter- 
stade in Greenland. Warming then accelerated, with a rapid 3-4°C 
rise from 15.2 to 14.7 ka, coincident with the Bolling onset; warm 
conditions persisted until 13.0 ka. 

The high relative abundance of the benthic foraminifera Episto- 
minella pacifica during the late glacial period (17-15 ka) indicates 
that the water column was less oxygenated than modern conditions 
(Extended Data Fig. 4). Following the accelerated warming into the 
Bolling, severe hypoxia seems to have started abruptly at 14.7 ka, as 
documented by a sharp transition to sediment laminations, a shift 
to benthic populations dominated by the low-oxygen foraminifera 
Bulimina exilis, and an increase in sedimentary molybdenum and ura- 
nium concentrations (Fig. 2). This transition coincided with increased 
concentrations of biogenic silica and marine organic carbon and a rise 
in the 5°N of organic matter, reflecting enhanced nutrient utilization 
and export productivity”? (Extended Data Figs 5 and 6). 

The increase in redox-sensitive trace metals (excess Mo and U)!*!3 
and low-oxygen benthic species slightly preceded the increase in 
biogenic silica and total organic carbon (TOC) (by 2-5 cm, about 
300 + 100 yr) during the Bolling—Allerod transition (Extended Data 
Fig. 7). This implies that local hypoxia may have developed before the 
increase in export productivity, most likely through a reduction in oxy- 
gen solubility related to ocean warming. In the absence of bioturbation, 
cm-scale depth offsets in these proxies may in part reflect redox gra- 
dients in the sediment column. However, a deeper site in the Gulf of 
Alaska (EW0408-26JC; 1,620 m) shows benthic faunal evidence for a 
gradual decrease in oxygen, starting with the early warming at 16 ka 
(Extended Data Fig. 8), well in advance of productivity enhancement. 

The benthic faunal assemblages and trace metals indicate that 
hypoxic conditions persisted while surface palaeotemperatures 
remained elevated near 10°C during the Bolling—Allerod inter- 
stade. More oxygenated conditions returned only when alkenone 
palaeotemperatures fell by 4°C during the Younger Dryas interval 
(12.9-11.7ka). Another abrupt 5 °C warming occurred during the 
transition into the early Holocene (11.4-10.9ka); SSTs above 11°C 
were again accompanied by an increase in biogenic silica concentra- 
tions, a rise in 5 N, and a return to hypoxic conditions (evidenced by 
laminations, low-oxygen benthic fauna, and an increase in authigenic 
Mo and U concentrations). 


364 | NATURE | VOL 527 | 19 NOVEMBER 2015 


Benthic faunal assemblage data document crossing of an ecological 
threshold during both episodes of hypoxia, with an abrupt transition 
from virtual absence to dominance of Bulimina exilis at the start of each 
event, indicating that the system switched nearly instantaneously from 
intermediate to severe hypoxia. Shortly following the initial peaks in 
Bulimina exilis, the site was re-colonized by other low-oxygen Bolivina 
species (Fig. 2), indicating a slight relaxation from extreme to strong 
hypoxia. Epistominella pacifica reappears in the faunal assemblages dur- 
ing the Younger Dryas and early Holocene, consistent with the return 
of more oxic conditions in bottom waters as the surface ocean cooled 
(Extended Data Fig. 4). Some low-oxygen benthic fauna and redox- 
sensitive trace metals persisted after decreases in SST and reduction in 
biogenic silica, suggesting a lag in the amelioration of hypoxic condi- 
tions. A trend of increasing relative percentage of Uvigerina peregrina 
during the Holocene indicates the water column became progressively 
well oxygenated. 

The alkenone palaeotemperature data show that most of the 
anomalously low 6'80 at onset of the Bolling interstade (1.1%o of the 
total 1.5%o change) reflects warming (Fig. 2). The temperature and 
ice-volume-corrected seawater 6'°O record is primarily an indicator 
of freshwater input from land, which is typically high in this stormy 
and glaciated region. Freshwater input varies near the onset of the 
Bolling—Allerod interstade hypoxic event, but there are no consistent 
trends that would indicate a sustained freshening of the surface ocean 
during either hypoxic event. This finding precludes haline stratifica- 
tion as a primary cause of hypoxia at this location. Accumulation of 
terrigenous silt is also low during the hypoxic intervals, arguing against 
glacial runoff and rock flour as a primary source of iron to fertilize 
high productivity at these times. However, high rates of terrigenous 
sediment accumulation on the shelf and slope during the late glacial 
period? would have provided a reservoir of iron in the sediments, 
which may then have been available for release in a bioavailable form 
upon initiation of hypoxia. 

Warming slightly precedes both the increase in productivity and 
the onset of hypoxia (rates of warming are 0.6-1.2°C per century in 
the lead up to both events; Extended Data Fig. 3), making it likely 
that temperature exerted a primary initial trigger for biogeochemical 
amplifying effects responsible for extensive hypoxia. This observation 
narrows the hypotheses to two (Fig. 3): either (1) rapid warming led 
to modest regional hypoxia via thermal solubility effects, which in 
turn stimulated marine primary productivity indirectly through the 
mobilization of iron from hypoxic sediments, or (2) rapid warming 
directly stimulated marine productivity, which led to higher con- 
sumption of oxygen in subsurface waters through remineralization 
of organic matter, with possible further amplification via reductive 
iron release. In both hypotheses, warming leads to hypoxia; they are 
not mutually exclusive. Both suggest increases in nutrient utilization 
in what is now an iron-limited system; this is supported by rising 6!°N 
of organic matter’? (Fig. 2) coupled to an increase in 5'°C in plank- 
tonic foraminifera (Extended Data Figs 5 and 6) during the warm 
events. 

The first hypothesis requires warming in subsurface waters. If the 
5'80 excursion (—1%o) in the benthic record were due to an increase 
in temperature alone, it would imply subsurface warming of about 
4-5°C at a depth of 682 m. The presumed implausibility of such 
warming at depth has led to the interpretation of this feature as a 
pulse of low salinity waters”. However, the occurrence of similar but 
smaller benthic 5!8O anomalies (—0.4-0.6%o) in deeper sites during 
the Bolling—Allerod interstade (Fig. 4, Extended Data Fig. 1) is con- 
sistent with up to about 2.0 °C abyssal warming; pulses of low salinity 
are implausible in the deep basin. 

Based on known thermal solubility effects, a 2.0°C warming would 
reduce subsurface oxygen concentrations by about 17 pmol kg~! 
(ref. 22), which if initiated relative to the modern OMZ of the North 
Pacific, would drive the site EW0408-85JC to less than 5 umolkg~', 
enough for significant ecological impacts”? (Fig. 1). The benthic fauna 
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Figure 3 | Schematic diagram of feedback processes linking ocean 
warming to enhanced export productivity. As CO, is vented from the 
deep ocean throughout the deglaciation, the ocean warms along with the 
climate. Warming reduces oxygen solubility and promotes stratification, 
leading to an expansion of the OMZ and a greater area of suboxic 
sediments in the shallow subsurface, which remobilizes bioavailable 

iron to fuel marine productivity, further depleting subsurface oxygen 
concentrations as organic matter is exported and respired at depth. The 
availability of iron and warm, stratified conditions favours large diatoms?’, 
which have high export efficiency owing to high settling velocities. 


suggest slightly lower oxygenation than today before the Bolling- 
Allerod interstade (Extended Data Fig. 4), so subsurface thermal sol- 
ubility effects might have been sufficient to trigger local hypoxia and 
initial vertical expansion of the OMZ, but are unlikely to drive wide- 
spread or severe hypoxia. If shoaling of the hypoxic boundary layer 
was sufficient to trigger an iron and phosphate” release from shallow 
sediments (that is, within reach of mixing into the euphotic zone), an 
ecological response favouring carbon export would have amplified ini- 
tial thermally driven hypoxia. Production of larger diatoms enhances 
carbon export to the subsurface”, and these diatom species are known 
to increase in relative abundance in response to increased iron input”® 
and stratification?”. Dominance of large diatoms in the overall species 
assemblage has been found elsewhere in the Northeast Pacific during 
deglacial warming’’. 

The second hypothesis revives a longstanding debate in biological 
oceanography following Eppley’s inference”® that warming enhances 
phytoplankton growth rate directly through a Qj (exponential physio- 
logical rate) effect. Such an effect may be important in subpolar regions, 
and may be enhanced by stratification that keeps phytoplankton near 
the well-illuminated, warmer sea surface and is classically associated 
with high nutrient uptake and export”. The Eppley hypothesis remains 
controversial. Thermal Qo effects on productivity, although perhaps 
real at high latitudes, are not supported at low latitudes’, making it an 
unlikely mechanism to account for the inferred increase in deglacial 
productivity in lower latitude regions’. 

Subsurface warming would increase remineralization rates of organic 
matter sinking out of the near-surface ocean, and thus biological oxy- 
gen demand in the zone most sensitive to hypoxia. As with the first 
scenario, carbon export to the subsurface ocean, rather than primary 
productivity itself, would be the key variable, suggesting control by 
ecosystem effects that favour large diatoms (such as iron availability), or 
other effects on particle sinking rates. Thus both scenarios imply ther- 
mal triggers, but both require biogeochemical amplification to sustain 
subsurface hypoxia, plausibly through reductive iron remobilization on 
continental margins!”. Such a feedback between ocean warming, OMZ 
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Figure 4 | Deglacial depth transect of 5'%O in the Gulf of Alaska. 
Planktonic foraminiferal data (Nps 5!8O) from core EW0408-85JC is used 
for the subsurface thermocline (depth = 100 m). Benthic foraminiferal 
5'8O data (expressed as Uvigerina peregrina equivalent values) from 
various core sites are plotted at depth: EW0408-85JC (682 m)?, EW0408- 
26JC (1,620 m), and EW0408-87JC (3,680 m) (data plotted in Extended 
Data Fig. 1). The plot is truncated at 2.2 km depth for an expanded view of 
intermediate depths. All data are corrected for the global isotopic effects of 
changing ice volume”’. Labels identify climate intervals specified in Fig. 2. 
The timing of the deglacial hypoxic events are indicated with pink bars. 
Details of the age models of individual cores are included in Methods. 
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expansion, and marine productivity may also provide a plausible mech- 
anism to explain links between interstadial warm periods and abrupt 
transitions to hypoxia observed throughout the North Pacific?*+”"', 
without the need to invoke changes in the Atlantic meridional over- 
turning circulation and associated multi-century time lags of nutricline 
adjustment within the global ocean®. 

Projected future warming of the subpolar North Pacific will prob- 
ably exceed the temperatures associated with past hypoxic events by 
the mid-twenty-first century”, at sustained rates comparable to those 
preceding the deglacial hypoxic events. If enhanced biological produc- 
tivity amplifies future deoxygenation as our evidence suggests it has 
in the past, substantial expansion of subsurface hypoxia beyond that 
predicted solely from thermal solubility effects may occur. While severe 
hypoxia would be catastrophic in the near-term for marine ecosystems 
and fisheries'®, the resulting reduction of carbon remineralization rates 
and enhanced burial of organic matter associated with hypoxia 
may also provide a long-term negative feedback on rising CO2 and 
greenhouse-driven warming, as may have occurred during the deglacial 
hypoxic events* (Extended Data Fig. 9). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Sediment cores. Marine sediment core EW0408-85JC is located on the continen- 
tal slope of the Gulf of Alaska (59° 33.32’ N, 144° 9.21’ W, 682m). In the modern 
setting, this site lies near the upper margin of the OMZ, where oxygen concen- 
trations are ~20 umolkg”! (Fig. 1). The Gulf of Alaska margin experiences high 
seasonal productivity during the spring and late summer months, with the highest 
chlorophyll levels observed along the northern margin”, near the site of EW0408- 
85JC. The North Pacific drift feeds into the cyclonic Alaskan gyre and Alaskan 
Coastal Current (ACC), which drives downwelling along the margin*. Details 
and photographs of the sediment lithology for this core are previously published’. 

Several sediment cores from various depths from the Northeast Pacific were also 
employed for the compilation of an oxygen isotope depth transect. Site EW0408- 
26JC/TC lies along the continental slope off the southeast Alaska margin (56° 
96! N, 136° 43’ W, 1,623 m), near the lower boundary of the OMZ (Fig. 1). Site 
EW0408-87JC is located under the Gulf of Alaska subpolar gyre, at a depth that 
lies underneath the modern OMZ (58° 77’ N, 144° 50’ W, 3,680 m). 

Age models. The age model for core EW0408-85JC is based on 36 radiocarbon 
dates of mixed planktonic foraminifera” calendar corrected using the Bayesian 
radiocarbon chronology program BChron* with the Marine13 database*, 
assuming a marine reservoir correction of 880 + 80 yr. The age models for cores 
EW0408-26JC/TC consist of 10 radiocarbon dates on mixed planktonic foraminif- 
era, calibrated with Calib 7.0 using a marine reservoir correction of 735 + 50 yr’. 
The age model for the trigger core of site EW0408-26 is poorly constrained due 
to low sedimentation rates, bioturbation, and carbonate dissolution in the upper 
sediments, therefore two tie points to the oxygen isotope stratigraphy of a nearby 
core with excellent age controls (EW0408-66JC) were used for the Holocene/YD 
boundary, and a modern age (0 yr BP) is assigned to the top of the core. The 
age model for core EW0408-87JC is based on 18 mixed planktonic radiocarbon 
dates calibrated with Calib 7.0 using a marine reservoir correction of 850 + 100 yr 
(Extended Data Table 1). One sample was excluded from the age model due to a 
small reversal. The age model for ODP Site 1019! (Extended Data Fig. 1) is updated 
with the age model from Lopes et al.'’. 

Ventilation estimates. Benthic radiocarbon ages were measured on mixed species 
of benthic foraminifera in the same samples as planktonic measurements to give 
an estimate of ventilation changes between the surface and deep waters at site 
EW0408-85JC”. Larger benthic-planktonic age differences reflect an increase in 
reservoir ages of subsurface waters, which may reflect either reduced ventilation 
rate, decreased preformed radiocarbon in the water mass, or mixing with an older 
water mass. Benthic ventilation ages were also evaluated using the projection age 
method”, which accounts for changes in the atmospheric “C history based on the 
preformed radiocarbon content of surface waters before subduction, but does not 
account for subsurface mixing of multiple water masses. 

Oxygen isotopes. Calculated 5'°O of seawater (5!8O,,) combines planktonic 5'°O 
of Nps or Gb? with alkenone palaeotemperatures (this paper) using a Gb calibra- 
tion equation*’, followed by correction for the global isotopic effect of changing 
ice volume’. 

The depth transect of 5'O includes Nps 5!8O data from core EW0408-85JC? as 

representative of near-surface conditions. It includes benthic 5'*O representative of 
subsurface conditions in cores EW0408-85JC (682 m depth)? and EW0408-26JC/ 
TC (1,623 m depth) from Uvigerina peregrina (Uvp), and in core EW0408-87JC 
(3,680 m depth) from Cibicidoides wuellerstorfi (+-0.64 %o). All were corrected for 
global ice volume’, but not for temperature. 
Alkenone palaeotemperature estimates. Total lipids were extracted from ~5 g of 
freeze-dried sediment as per Walinsky et al.*°. Linear, alkenone-containing frac- 
tions were isolated via urea adduction"! and analysed using capillary gas chroma- 
tography with flame ionization detection. Analytical uncertainties in the 
quantification of C37 ketone abundance (K37:2, K37:3 and K37:4) include both a 
mean potential ‘baseline contaminant’ component (positive) and an assumed 5% 
ad in the integrated area (random). Minimum and maximum estimates 
of Uz, {=(K37:2)/(K37:2 + K37:3)}” were determined from the uncertainty in 
K37:2 and K37:3 concentrations. The corresponding uncertainties in temperature 
estimates are large in samples in the deepest part of the core (corresponding in 
time to 17.5-17.0ka) due to extremely low K37 concentrations, most likely related 
high sedimentation rates. Core-top alkenone palaeotemperatures are biased 
towards summer, and thus are representative of temperatures experienced by the 
photosynthetic source of these biomarkers’. 

Various studies have shown that different K37 compounds can be aerated 
selectively“. Consequently, the sea-surface water temperature proxy, U4 , can be 
diagenetically biased to some extent. Most lines of evidence indicate preferential 
degradation of the more unsaturated K37:3. As a result, ux’ values, if altered, 
typically are shifted positively, thereby depicting apparently warmer values. The 
magnitude of the documented diagenetic warming bias appeared to be ~1°C or 
less under the extreme conditions of alkenone degradation experienced in the 
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aerobic burn-down phenomena documented by turbidite records from the 
Madeira Abyssal Plain*®. 

Bacteria capable of both selective and non-selective aerobic degradation of alk- 
enones have been studied in the laboratory*°. In cases where selective alkenone 
degradation led to a positive shift in U35 values, epoxide derivatives were measured 
as intermediate products of the Process. An empirical calibration of the apparent 
diagenetic ‘warming’ effect on ae values caused as a function of these epoxide 
intermediates has now been defined’. 

Alkenones and corresponding epoxide intermediates were measured in modern 
surface sediments collected by multi-core throughout our Southeast Alaska study 
area’, Interpretation of the results using the laboratory-defined empirical calibra- 
tion suggested U45 -based SST estimates were biased too warm by as much as 
1-2.5°C*. The perceived warming effect shows a weak correlation (r= +0.26) 
to the water depth at which each multi-core sample was collected, hinting that the 
magnitude of the warm bias is directly proportional to the availability of dissolved 
oxygen. Therefore, it is unlikely that estimated increases in SST during the hypoxic 
intervals in the Bolling—Allerod interstade (BA) and Holocene are related to 
changes in the sedimentary diagenetic environment. Observations of similar 
warming events during the BA and early Holocene from the California margin” 
(Extended Data Fig. 9) provide further support that these SST reconstructions 
reflect regional climate trends rather than local diagenetic imprints. Furthermore, 
the higher abundances of Epistominella pacifica relative to Bolivina and Bulimina 
species during cooling episodes in the SST reconstruction, such as the late glacial, 
Younger Dryas, and early to mid-Holocene (11-6 ka), suggest more oxygenated 
conditions during cool intervals. These trends are opposite of what would be 
expected if there was an influence of oxidation on the alkenone SST record, sug- 
gesting that a diagenetic warming effect, if present, would most likely act to 
dampen the observed magnitude of SST changes. 

SST was also estimated via the Ux temperature index {=(K37:2 — K37:4)/ 
(K37:2 a+ K37:3 + K37:4)} as calibrated by Prahl et al.* (Extended Data Fig. 2). 
The US, index has been suggested to be a more reliable proxy for SST at tempera- 
tures below 8°C*8“°, The Uj; index results in SST estimates that are ~4°C 
colder during the late glacial period and ~1 °C warmer during the Holocene period 
relative to the us SST estimates, reflecting the influence of changes in the con- 
centration of tetraunsaturated K37:4, which tends to have higher abundances 
during glacial times relative to interglacial times (Prahl et al.*° and present study). 
However, the two SST estimates for the hypoxic intervals are virtually identical, 
providing further support that temperature estimates for these intervals are not 
influenced by preferential degradation of more unsaturated compounds. 
Additionally, the concentration of alkenones are highest during the hypoxic inter- 
vals (Extended Data Fig. 2), which not only increases the signal to noise ratio of 
our analyses (decreasing the associated SST errors), but is also consistent with 
excellent preservation of organic biomarkers during these events. 

Average rates of SST change were calculated in a 400-yr window from the 
palaeotemperature record after interpolating on a 200-yr time step (Extended 
data Fig. 3) 

Benthic faunal abundances. Benthic species abundances were counted from the 
>150 1m size fraction with sample splits that ranged from 25 - 400 benthic spec- 
imens (Extended data Fig. 4). Individual specimens were classified into 12 genera 
and species categories, and the percent abundance for each species was calculated 
relative to the total number of benthic species counted. Bulimina and Bolivina are 
both elongate infaunal benthic genera that are considered indicators of low-oxygen 
conditions, and sometimes associated with high export productivity” Lt Bulimina 
exilis is indicative of the most severe hypoxic/anoxic conditions®> . Epistominella 
pacifica is an epifaunal species that can tolerate intermediate to strong hypoxia 
and are often found in the upper and lower boundary zones of OMZs™. Uvigerina 
peregrina can tolerate intermediate to weak hypoxia, but is typically absent from the 
core of the OMZ. Therefore the relative abundances of these species reflect changes 
in bottom water oxygen concentrations, with high abundances of Uvigerina 
peregrina reflecting relatively well-oxygenated conditions, Epistominella pacifica 
reflecting intermediate hypoxia, Bolivina genera reflecting strong hypoxia, and 
Bulimina exilis reflecting strongly hypoxic to anoxic conditions. 

Trace metal data. Trace metal concentration data and methods are previously 
published!*, Total metal concentration data (Mery) of Mo and U was converted 
to excess (xs) values relative to lithogenic background, using Al concentration data 
in the core and the relationship: Mex, = Mey — (Me/Al jiithogenic X Al using average 
continental crust values of Mo/Al=0.19 x 10-4 and U/Al=0.35 x 10~‘ (ref. 19). 
Both raw concentration data and excess concentrations of Mo and U show similar 
trends, with the greatest enrichments of U and Mo during the hypoxic intervals. 
Organic data. Biogenic silica and total organic carbon data and methods are 
previously published’. The 6!°N data and methods are previously published, 
along with a detailed discussion of various influences that may contribute to the 
bulk 5'5N signature!*. A “corrected” marine 5'°N record was also calculated by 
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correcting for the 6!°N component imparted from terrestrial organic matter? 
(Extended Data Fig. 5). This record shows similar overall trends to the raw 6°N 
record, with enriched 5'°N during the BA and early Holocene hypoxic events. 
Elevated !°N during the hypoxic intervals is consistent with an increase in nutri- 
ent utilization rate, which in this iron-limited setting would likely require an 
elevated iron source. 

The 5!°N data alone do not prove nutrient utilization; an alternate interpretation 
is that the high 5'°N events during the deglacial transition are an advected signal 
from low latitudes, where a stronger oxygen minimum zone resulted in water 
column denitrification®®. The viability of the undercurrent transport hypothesis 
is supported by modern tracer distributions that show California Undercurrent 
water detectable as far north as Alaska®”. Addison et al.'3 discounted the undercur- 
rent transport hypothesis as an explanation for deglacial increases in 6!°N, simply 
because the fraction of undercurrent water reaching Alaska today is small (<15% 
of the subsurface water mass). Implausibly high variations in the tropical source 
waters (>12 %o) would be required to explain the ~2%o 6'°N changes in the Gulf of 
Alaska; such large changes have not been observed in the eastern tropical Pacific™*. 
There is no dynamical mechanism to explain large increases in net pole-ward 
transport relative to mixing with ambient waters along the flow path that could 
yield such high-amplitude 6'°N changes in the Gulf of Alaska. 

Additional data argues in favour of a nutrient utilization mechanism to explain 
the high 6°N events in the Gulf of Alaska. Planktonic foraminiferal 5'°C rises dur- 
ing the Bolling—Allerod and early Holocene warming events, sympathetic with high 
515N events (Extended Data Figs 5 and 6). If the 5'°N were explained entirely by 
increased northward advection of high-nutrient, low-oxygen undercurrent waters 
from the tropics, the warm events should be accompanied by anomalously low 63°C 
of dissolved inorganic carbon. The same conflict appears for explanations of high 
productivity by upwelling of deep nutrient rich waters at these times”; upwelling 
of nutrient-rich waters without an increase in fractional nutrient utilization would 
yield low 6°C (Extended Data Fig. 6). 

A regime change occurs in the early Holocene, from a deglacial interval in which 
5 N is positively correlated with 5'°C (17-9ka, 1 =0.51) to a Holocene system 
in which 5'°N and 5'3C are weakly negatively correlated (Extended Data Fig. 6). 
Consideration of changes in atmospheric 5'3C (ref. 59) and air-sea equilibrium 
of the 6'°C of carbonate ion as a function of temperature™ strengthens this rela- 
tionship (Extended Data Fig. 6). The co-occurrence of high 6'°C with high 6!°N 
during the deglacial interval points to nutrient utilization rate and carbon export 
as a key driver of the hypoxic events. 

This view of nutrient utilization, likely related to removal of iron limitation dur- 

ing the deglacial interval is not inconsistent with the general view of advection of 
low-oxygen waters northward in the California Undercurrent. Indeed, northward 
advection of such low oxygen waters near the shelf-slope break would provide a 
potential reductive source of iron and phosphate‘ from sediments, and would be 
part ofa self-sustaining feedback in the northeast Pacific in which initial hypoxia 
would be sustained and strengthened by iron-fuelled export productivity. Iron can 
be transported relatively long distances in the subsurface ocean in the colloidal 
(essentially non-sinking) fraction®’. Such a mechanism is consistent with sea-level 
rise onto the continental shelves as a possible iron source’, and addresses concerns 
that isostatic rebound puts the local sea-level record in the northern Gulf of Alaska 
out of synch with the hypoxic events’. 
Leads/lags in proxy data. The collection of multiple proxies within the same core 
allows for a precise examination of the timing of redox changes relative to changes 
in oceanographic conditions and export productivity (Extended Data Fig. 7). The 
transition to laminated sediments during the BA occurs with a sharp sedimento- 
logical boundary at 681 cm core depth”. The laminations are closely associated with 
high weight percentages of biogenic opal, consistent with high export of diatoms. 
The laminated intervals are also clearly defined in the X-ray computed tomography 
(CT) grey scan data, which largely tracks sediment density as a function of the 
biogenic to lithogenic fraction, with low values indicating times of high biogenic 
input’. The initial increase in SST precedes the increase in opal and the onset of 
laminations during the BA hypoxic event; temperatures > 10°C are associated with 
the interval of high diatom abundances. The increase in U and Mo occurs slightly 
before (2-5 cm) the increases in biogenic opal, TOC, and the decrease in CT grey 
scale. This may reflect a progression towards low oxygen conditions before the 
increase in export productivity. It is possible that such small depth offsets between 
the increase in trace metals and organic matter concentration may reflect preserved 
redox gradients in the sediment column, and thus not truly represent offsets in the 
time domain. However, all these proxies are emplaced within the bioturbed mixed 
layer and within a few cm of the seafloor during laminated intervals”, so depth 
offsets between proxies should be minor. 

There are no discernible leads or lags between the increase in SST and export 
productivity for the Holocene hypoxic event. However, in this interval, the increase 
in U and Mo appear to slightly lag the increase in biogenic silica and the transition 


to low-oxygen benthic fauna. The Holocene laminated interval is not as precisely 
defined as the onset of the BA laminations, partly due to weaker (slightly mottled) 
laminations, making the evaluation of depth offsets in the proxy data less reliable 
than for the BA sequence. 

The switch from oxic to hypoxic/anoxic-tolerant benthic fauna occurs 
abruptly near the onset of laminations for both the BA and Holocene events. 
However, low-oxygen benthic species dominate the faunal assemblages well after 
the termination of laminations and the decrease in biogenic silica. Similar trends 
can be seen in the trace metal data. This concordance may indicate that low- 
oxygen conditions persisted even after the decline in export productivity and 
the cessation of laminations. This is consistent with a hysteresis-like response 
in the benthos, with an abrupt threshold transition to a hypoxic regime, fol- 
lowed by a more gradual return to an oxic regime with a diversity of benthic 
fauna®. Some upward mixing of older hypoxic fauna by bioturbation could have 
occurred when oxic conditions returned, however, this is unlikely to account 
for the high abundances (60 - 80%) of low-oxygen fauna that persisted in these 
intervals. 

Based on this sequence of events, it appears most probable that sea surface 
warming lead to a reduction of dissolved oxygen in the subsurface through the 
combined effects of reduced oxygen solubility and enhanced thermal stratification. 
Benthic fauna assemblages from a deeper site in the Gulf of Alaska (near the lower 
boundary of the OMZ) suggest a gradual progression towards hypoxia starting 
at 16 ka, followed by an abrupt onset of laminations at the transition into the BA 
(Extended Data Fig. 8). The initial reduction of dissolved oxygen in the subsurface 
would have intensified the OMZ, leading to an expanded area of hypoxic shelf 
sediments (Fig. 3). A shoaling of the upper boundary of the OMZ to ~300 m during 
the BA hypoxic event has been documented in benthic faunal assemblages from 
the California Borderland basins™. 

The supply of sedimentary iron is thought to be most efficient in a redox win- 

dow where neither oxygen nor sulfide is present’’, making such “new hypoxic 
zones’ prime candidates for the release of bioavailable iron, especially in the shal- 
lower depths where iron can be more easily upwelled to the surface, as occurs in 
offshore regions of the Gulf of Alaska. Additional sources of iron include freshwater 
runoff charged with glacial rock flour. Such supplies of iron could have helped 
to fuel primary productivity, enhanced export productivity, and further depleted 
subsurface ocean concentrations, leading to a threshold-like feedback effect to 
amplify ocean deoxygenation. 
Feedbacks on the carbon cycle. The timing of the North Pacific hypoxic events 
approximately coincided with two intervals of abrupt increase in atmospheric N»O 
(ref. 65) and a cessation in the rise of atmospheric CO, (refs 66, 67) (Extended Data 
Fig. 9). The widespread expansion of hypoxic zones in the North Pacific could have 
led to denitrification, contributing to the two abrupt increases in atmospheric NO, 
while the enhanced export flux and burial of organic carbon may have helped to 
stabilize the rise in atmospheric CO, (ref. 4). Thus, the temperature evolution 
of the North Pacific could play a prominent role in the regulation of multiple 
greenhouse gases. 

Initial deglacial warming could promote out-gassing of deep-ocean respired 
carbon. Continuous and/or abrupt warming could have pushed large areas of 
the North Pacific across thresholds of hypoxia, in which the cycling of nutrients 
and carbon was fundamentally altered, contributing to expansive denitrification 
and the release of NO gases. However, the development of widespread hypoxia 
may ultimately act as a negative feedback on rising CO; and global warming, 
with the release of nutrients from hypoxic sediments acting to stimulate surface 
productivity (in particular, diatoms with high efficiency for carbon export””) and 
the decrease in water column oxygen concentration helping to promote carbon 
burial (Fig. 3). Thus, thresholds of hypoxia in the North Pacific linked to ocean 
warming have the potential to switch this region from a source to sink of carbon. 
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the deglacial period, including the two hypoxic intervals and the Younger and/or excellent preservation of organic matter during these events. 
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Extended Data Figure 3 | Rate of SST change in the Gulf of Alaska. The alkenone palaeotemperature record (purple) was interpolated on a 200-yr time 
step and the average rate of temperature change (°C per century) was calculated over a 400-yr window (black). 
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Extended Data Figure 6 | Comparison of planktonic 6'°C with 
sedimentary 5°N in core EW0408-85JC. a, 5'°C of planktonic 
foraminifera, Nps (blue), Gb (red), average of the two species (black). 

b, Comparison of the average planktonic foraminiferal 5'°C (black) with 
changes in 5'°C of atmospheric CO) (blue)*’ and estimated surface ocean 
813C of CO3=. (red, calculated from the smooth atmospheric values using 
the temperature relationship of Zhang et al.™ c, Relationship between 
§°N in organic matter and 54°C in planktonic foraminifera (average of Gb 
and Nps). The 6!°N and 5!°C measurements were in most cases made in 
adjacent samples, typically separated by 1 cm. To prevent directional bias 
in the scatter plot, the two variables were first interpolated linearly onto 
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the depth of the other variable. In no cases were interpolations allowed 
over an interval >5 cm or 200 years. The positive correlation between 
organic 5'°N and planktonic foraminiferal 6'°C during the deglacial 
interval (17-9 ka, red points, r? = 0.51) supports an interpretation of 
increased nutrient utilization and carbon export from near-surface waters 
associated with the high 6'°N events. In contrast, within Holocene time 
(9-0ka) the relationship between 5'°N and 5'3C reverses, suggesting no 
systematic variations in nutrient utilization. d, Time series of organic 6°N 
(organic) and 613C (average of the planktonic foraminifera Gb and Nps) as 
a time series; these data form the comparison in c. 
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with high diatom abundance and SSTs near or exceeding 10°C, whereas 
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Extended Data Figure 8 | Surface climate proxies compared with changes —_ low-oxygen benthic fauna is apparent in the deeper site (EW0408-26JC) 


Bolivina & Bulimina (% benthics) 


in benthic 580 and fauna from different depth sites in the Gulf of commencing at 16 ka, which coincides with the pre-Bolling warming in the 
Alaska. The alkenone palaeotemperature record from core EW0408-85JC SST record and an increase in the planktonic and benthic 6'8O records. This 
(purple), a composite record of planktonic 5'8O from cores EW0408- initial decrease (blue shading) in sedimentary oxygen content at the base of 
26JC and EW0408-66JC (black)**, benthic 5'8O (dark blue) and biogenic the OMZ clearly precedes the large increase in biogenic silica and the shift 
silica (bright blue) from core EW0408-85JC?, the combined abundance of to hypoxic conditions in core EW0408-85JC near the onset of sedimentary 
low-oxygen tolerant Bolivina and Bulimina species from cores EW0408- laminations (pink shading). Sediment laminations in core EW0408-26JC 
85JC (682 m; green) and EW0408-26JC (1,620 m; violet). An increase in occur from 15-14 ka (shaded bar on x-axis). 
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Extended Data Figure 9 | Northeast Pacific SSTs, productivity indices, light green), total nitrogen (TN: dark green), and 5’°N records on bulk 
and atmospheric greenhouse gases. Data from top: CO) record from EDC organic matter from core EW0408-85JC'’. Grey shaded bars indicate the 
(dark blue)®* and WAIS (light blue)*”, a record of N2O from TALOS Dome two intervals in which the deglacial rise in CO, plateaus/reverses, which 
(bright blue)®, the Gulf of Alaska uw SST record (purple), a us SST record generally correspond to the episodes of widespread North Pacific hypoxia, 
from the California margin (black)*”, records of total organic carbon (TOC: high SSTs, enhanced nitrate utilization, and increased export productivity. 
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Extended Data Table 1 | Radiocarbon age controls for core EW0408-87JC 
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Radiocarbon measurements were made on mixed planktonic foraminifera, sinistral Neogloboquadrina pachyderma (Nps) and Globigerina bulloides (Gb) picked from the >150 um size fraction. 
Radiocarbon samples were analysed at the Keck AMS facility at U.C. Irvine. Radiocarbon dates were calibrated with Calib 7.0 using the Marine13 calibration curve and a marine reservoir correction 


of 850+ 100 yr. The 2-c midpoint age, rounded to the nearest decade, is used for the age model. One sample was excluded from the age model due to a minor age reversal (denoted with an asterisk), 
although this date is within uncertainty of the adjacent shallower radiocarbon date, indicating very high sedimentation rates in this section of the core. 


© 2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature15509 


The effects of life history and sexual selection on 
male and female plumage colouration 


James Dale!, Cody J. Dey*, Kaspar Delhey*+, Bart Kempenaers® & Mihai Valcu® 


Classical sexual selection theory!* provides a well-supported 
conceptual framework for understanding the evolution and 
signalling function of male ornaments. It predicts that males 
obtain greater fitness benefits than females through multiple 
mating because sperm are cheaper to produce than eggs. Sexual 
selection should therefore lead to the evolution of male-biased 
secondary sexual characters. However, females of many species are 
also highly ornamented*’. The view that this is due to a correlated 
genetic response to selection on males!* was widely accepted 
as an explanation for female ornamentation for over 100 years® 
and current theoretical®’° and empirical!" evidence suggests 
that genetic constraints can limit sex-specific trait evolution. 
Alternatively, female ornamentation can be the outcome of direct 
selection for signalling needs”. Since few studies have explored 
interspecific patterns of both male and female elaboration, our 
understanding of the evolution of animal ornamentation remains 
incomplete, especially over broad taxonomic scales. Here we use 
a new method to quantify plumage colour of all ~6,000 species 
of passerine birds to determine the main evolutionary drivers of 
ornamental colouration in both sexes. We found that conspecific 
male and female colour elaboration are strongly correlated, 
suggesting that evolutionary changes in one sex are constrained 
by changes in the other sex. Both sexes are more ornamented in 
larger species and in species living in tropical environments. 
Ornamentation in females (but not males) is increased in 
cooperative breeders—species in which female-female competition 
for reproductive opportunities and other resources related to 
breeding may be high®. Finally, strong sexual selection on males has 
antagonistic effects, causing an increase in male colouration but a 
considerably more pronounced reduction in female ornamentation. 
Our results indicate that although there may be genetic constraints 
to sexually independent colour evolution, both female and male 
ornamentation are strongly and often differentially related to 
morphological, social and life-history variables. 

The extraordinary interspecific variation in bird colouration has 
provided model studies on animal ornamentation (Fig. 1). The many 
striking cases of sexual dichromatism in birds illustrate the power of 
sexual selection because such species often have highly polygynous 
mating systems*, However, other factors besides sexual selection can 
influence colour elaboration'!® and there is growing evidence that 
many female ornaments are adaptive and subject to direct selection®”. 
Females often compete for ecological resources, and may use ornamen- 
tal traits to mediate competitive interactions. Since male ornaments 
are also used during competition for non-sexual resources (for exam- 
ple, food, territories), ornament evolution in both sexes may be better 
understood through the concept of social selection'*, which encom- 
passes both traditional sexual selection, and selection on non-sexual 
interactions”!”. Under this framework, male and female ornamentation 
should correlate with both sexual selection and life-history traits that 


potentially influence the level of competition for resources. To date, 
the life-history traits associated with ornamentation in both males and 
females remain poorly understood. 

A fundamental challenge in resolving these issues is the quan- 
tification of colour ornamentation in a way that allows meaningful 
interspecific comparisons. We developed a method to quantify colour 
elaboration by determining how ‘male-like’ a focal plumage is (Fig. 2). 
This approach, which can be used with any colour quantification tech- 
nique (see Methods and Extended Data Fig. 1), has the important prop- 
erty of quantifying diverse colours using a single metric (that is, how 
male-like it is). Thus, birds with dramatically different appearances 
and of different sex can have similar scores (for example, Fig. la, d). 


Figure 1 | Interspecific variation in avian plumage colouration. 

a-d, Birds display an astonishing variety of colour patterns. The intense 
blue plumage of male Hainan blue flycatchers (Cyornis hainanus) (a) is 
dramatically more colourful than female plumage (b) in this strongly 
sexually dichromatic species. In contrast, pale flycatchers (Bradornis 
pallidus) are sexually monochromatic and both males (c) and females (not 
shown) are drab coloured. Other monochromatic species, however, can be 
highly ornamented, for example, female red warblers (Cardellina ruber) 
(d) have a degree of plumage colouration that rivals male Hainan blue 
flycatchers (plumage scores: 71.3 and 71.7, respectively). The evolutionary 
causes of this diversity are not well understood. Credits: a, S. Kongwithtaya; 
b, M. & P. Wong; c, M. Goodey; d, S. Colenutt. 


Unstitute of Natural & Mathematical Sciences, Massey University, Auckland 0745, New Zealand. @Department of Biology, McMaster University, 1280 Main St. West, Hamilton, Ontario L8S 4K1, 
Canada. °School of Biological Sciences, Monash University, Victoria 3800, Australia. “Max Planck Institute for Ornithology, Am Obstberg 1, 78315 Radolfzell, Germany. Department of Behavioural 
Ecology and Evolutionary Genetics, Max Planck Institute for Ornithology, Eberhard Gwinner Str, 82319 Seewiesen, Germany. 
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score = 41.53 


Figure 2 | The method used for quantifying plumage colouration. 

a-f, For each sex of each species we identified similarly coloured plumage 
patches in all other species and calculated the proportion of those patches 
that are male. We illustrate this for the male red-billed quelea Quelea quelea 
(image from Lynx Edicions used with permission). a, Digitally scanned 
images of all passerine birds from the Handbook of the Birds of the World!® 
(N=5,983) were processed in the R package ‘colorZapper’ (see Methods). 
b, Red, green, blue (RGB) values in three dorsal (nape, crown, forehead) 
and three ventral (throat, upper breast, lower breast) patches were measured 
(RGB values for male upper breast shown). c, For each plumage patch, 
colour values for both sexes in all species were pooled. The panel depicts 
upper breast patch scores visualized in RGB colour space, N= 11,966 points 
(that is, 2 points for each species, point colour is determined from actual 
RGB values). d, e, For each sex in each species the nearest 1% (N= 120) 

of data points (in Euclidian space) were identified (coloured data points) 
(d) and the percentage of males (blue points) was determined and used as a 
‘patch score’ (e). f, The average of the six patch scores was used as the final 
‘plumage score’ for each sex (male and female red-billed quelea illustrated). 
Plumage scores thus reflect how ‘male-like’ or ‘female-like’ a plumage 

is, but are determined independently of the sex of each focal data point. 
There is a high correlation between scores determined as described here 
versus analogous scores determined with ultraviolet-to-visible reflectance 
spectra from museum specimens (Extended Data Fig. 1). The overall 
patterns reported in this paper are highly robust to different cut-offs used to 
calculate plumage scores (Extended Data Fig. 3). 


We quantified male and female plumage colouration in all passerine 
birds (Order: Passeriformes) illustrated in the Handbook of the Birds of 
the World'®. Passerines represent the most derived and largest avian radi- 
ation (V=5,983 species, 61% of all birds). Figure 3a illustrates the basic 
patterns of colour variation in this group: in many species males are more 
colourful than females, but there are also many sexually monochromatic 
species with extensive variation in colour elaboration (Extended Data 
Fig. 2 shows the colours and associated scores for different patches). 

Across the passerines, male and female plumage scores were highly 
correlated (Fig. 3a, phylogenetically controlled reduced major axis 
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Figure 3 | Plumage scores and plumage dichromatism in relation to 

key predictors in passerine birds. a, Male plumage score versus female 
plumage score (N= 5,983 species). b-f, Contour maps depicting the average 
of the (scaled) predictor values overlaid within the 99.8% volume contour of 
the male versus female plumage score distribution. Note that the strength of 
the relationship between predictor values and plumage colouration varies 
between plots (as reflected by varying ranges in the contour legends). Values 
within the plots were calculated by superimposing a 300 x 300 grid over the 
scatter occurring between 30 and 80 and then calculating at each grid point 
the mean predictor value of the closest 3% of species. 


regression, R?=0.299, T=5.96, P< 0.0001). Multivariate Ornstein- 
Uhlenbeck evolutionary models indicate that plumage colour evolution 
is subject to cross-sex constraints that partially restrict independent 
trait evolution (Extended Data Table 1). Cross-sex constraints, however, 
do not imply that ornamentation cannot also be directly selected for 
signalling needs’”. 

To investigate colour diversity among species, we first tested for 
correlations between sexual dichromatism and ten predictor varia- 
bles. Sexual dichromatism decreased with body mass and wing length, 
increased with latitude, more seasonal environments and clutch size, 
increased with social polygyny, sexual size dimorphism and female- 
only parental care, decreased with cooperative breeding and increased 
with migratory behaviour (Extended Data Table 2a). 

Because many of the ten predictor variables are strongly intercorre- 
lated, we next consolidated variation to five main predictors (Extended 
Data Table 3): (1) species with large ‘body size’ values were heavier and 
had longer wings; (2) species with high ‘tropical life history’ values were 
more likely to breed in the tropics, inhabit areas with year-long envi- 
ronmental stability and lay small clutches; (3) species with high ‘sexual 
selection scores tended to be socially polygynous, show male-biased 
sexual size dimorphism and lack paternal care; (4) ‘cooperative breed- 
ing’ was defined as present or absent; and (5) ‘migration as no, partial 
or complete migration between breeding and non-breeding ranges. 
Multiple predictor models (Extended Data Table 2b), which control 
for the confounding effects of the predictors, showed sexual selection 
to be the strongest predictor of dichromatism (followed by migration 
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Figure 4 | Coefficient estimates and model lines of linear mixed models 
predicting plumage scores in 2,471 species of passerines. a, Coefficients 
plot of the effect sizes for each of the five key predictor variables (scaled) 
and their respective interactions with sex. The centre point denotes the 
mean, the thick bar denotes the posterior standard deviations (akin to 

the standard errors) and the thin bars denote the 95% lower and upper 
confidence limits as calculated by model-averaging 100 separate Markov 
chain Monte Carlo generalized linear mixed models using the package 
MCMCglmm” in R and 100 different phylogenetic trees from 
http://birdtree.org*’. b-f, Model predictions of the effect sizes keeping the 
effects of other predictors constant (males, blue; females, pink). Solid black 
lines denote the mean effect size of the 100 separate MCMCglmm runs and 
coloured areas illustrate the full range of regression line estimates 

across all models. 


and body size) while tropical life history did not have a significant 
effect. These results are difficult to interpret, however, because dichro- 
matism can vary through evolutionary changes in males, females or 
both sexes!’. Therefore, to resolve the evolutionary drivers associated 
with colour elaboration itself, we analysed male and female plumage 
phenotypes concurrently. 

Phylogenetically informed linear mixed models” on complete data 
from 2,471 species (Fig. 4 and Extended Data Table 4) showed that 
the five predictor variables explained more interspecific variation in 
female than in male colouration. Female colouration was elaborated 
(more male-like) in larger species, in species with tropical life histories 
and in cooperative breeders. In males, larger species and species with 
tropical life histories also had more colour-elaborated plumage, but the 
effect of body size was smaller (Fig. 4b). As expected, in species with 
strong male-biased sexual selection, male colours were significantly 
more elaborated. However, sexual selection had a much larger and 
opposite effect on females: in species with strong male-biased sexual 
selection, females had greatly reduced colour elaboration (Fig. 4d). 
Plumage dichromatism therefore increased in species with strong 
sexual selection, as reported in numerous other studies!°, but our 
results show that the principal driver of this pattern is evolutionary 
change in female, not male, colour elaboration. 

Because our predictor variables might interact in complex ways, 
we used phylogenetic path analysis”! to disentangle cause and effect 
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Figure 5 | Relationships among ecological variables and plumage 
colouration, as determined by phylogenetic controlled d separation 
path analysis”!. Arrows indicate direct effects; the strength of the effect 
is indicated with numeric values and by line thickness. Arrow colour 
indicates the direction of the effect (black, positive; red, negative). 


relationships. This supported the analysis described earlier, demon- 
strating that sexual selection has a strong direct negative effect on 
female colouration, and a weaker direct positive effect on male colour- 
ation (Fig. 5). Additionally, female colouration was directly influenced 
by all predictor variables. Models that included a correlated response 
to male colour were generally favoured, but models in which female 
colour was only directly influenced by male colour and not by other 
variables performed poorly (Extended Data Fig. 4 and Extended Data 
Table 5). Thus, our results suggest that female colouration is not merely 
a by-product of strong selection on males, but is an adaptive response 
to various social and life-history factors. 

Three main conclusions follow from our study. First, although body 
size is rarely considered in interspecific studies of animal colouration 
(but see ref. 22), it strongly predicted colour elaboration in both sexes, 
and more strongly in females than in males. Larger species are both 
more colour elaborated and less dichromatic (Fig. 4b), consistent with 
the hypothesis that being larger reduces predation risk”, thereby poten- 
tially weakening selection for crypsis. The result refutes the argument 
that large body size itself is an evolutionary constraint on colouration” 
(at least for passerines). 

Second, both sexes of species with tropical life histories, typified by 
equatorial breeding ranges, low seasonality and small clutch sizes”, 
were more elaborated than temperate breeding species (Fig. 4c), a 
robust verification of the hypothesis that tropical species are more 
colourful’*. The strength of this effect rivals the strength of the effect 
of sexual selection on colouration (at least for males). These patterns 
are consistent with two non-mutually exclusive hypotheses. First, 
colourful plumage functions in mate choice and mutual sexual selection 
is stronger in tropical species; and second, colourful plumage functions 
in an aggressive context and resource competition is stronger in tropi- 
cal species. Indeed, tropical species are thought to be under increased 
competition for breeding vacancies and resources, have longer-term 
pair bonds, more common year-round territoriality and increased con- 
vergence of male and female reproductive roles**. Moreover, there is 
an obvious convergence between visual and vocal signalling in tropical 
birds. Female song is more common in tropical species and experi- 
mental evidence suggests that song in tropical species has dual func- 
tionality as both advertisement for mates as well as an ‘armament’ in 
competitive interactions”®. 

Third, the intensity of sexual selection strongly predicted varia- 
tion in plumage colouration (Fig. 4d): more elaborated males and 
increased sexual dichromatism are found in species with male-biased 
sexual selection. Increased sexual dichromatism, however, was driven 
mainly by a strong negative relationship between the intensity of sexual 
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selection and female colouration. A similar pattern was observed in 
New World blackbirds (Icteridae)”’, but our study demonstrates its 
generality within the passerines. This finding is consistent with two 
hypotheses. First, it can reflect increased sexual selection on females in 
monogamous species, supporting a game-theoretic model that predicts 
stable mutual mate choice in species with extensive parental investment 
in both sexes”*. Second, selection for social signalling may be reduced 
in females of species with strong male-biased sexual selection”’. Indeed, 
ecological factors that favour social polygyny”’ (for example, spa- 
tially clumped resources) may also be associated with reduced social 
competition between females, at least in the absence of paternal care. 
Additionally, high levels of male-biased sexual selection seem to break 
the correlation between male and female ornamentation (Fig. 4d). Here, 
the divergent parental roles of males and females may facilitate the evo- 
lution of sex-specific developmental modifiers (for example, hormones) 
that limit the expression of ornamental plumage in females". 

Traditionally, studies have focused on male colour elaboration, par- 
ticularly in species with extreme sexual selection. This has left much 
of the interspecific variation in colouration unexplained. Our results 
demonstrate clearly that the immense diversity in avian plumage col- 
ouration is the outcome of selection acting additively and often dif- 
ferentially between the sexes (Fig. 3). The patterns presented here for 
the passerines can be tested in other taxa and provide a rich arena for 
future hypothesis testing on the function of ornamentation in males 
and females. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 21 May; accepted 21 August 2015. 
Published online 4 November 2015. 


1. Darwin, C. The Descent of Man, and Selection in Relation to Sex John Murray, 

1871). 

Bateman, A. J. Intra-sexual selection in Drosophila. Heredity 2, 349-368 (1948). 

Trivers, R. in Sexual Selection and the Descent of Man 1871-1971 (ed. 

Campbell, B.) 136-179 (Aldine, 1972). 

Andersson, M. B. Sexual Selection (Princeton Univ. Press, 1994). 

Amundsen, T. Why are female birds ornamented? Trends Ecol. Evol. 15, 

149-155 (2000). 

Rubenstein, D. R. & Lovette, |. J. Reproductive skew and selection on female 

ornamentation in social species. Nature 462, 786-789 (2009). 

7. Tobias, J. A., Montgomerie, R. & Lyon, B. E. The evolution of female ornaments 
and weaponry: social selection, sexual selection and ecological competition. 
Phil. Trans. R. Soc. B. 367, 2274-2293 (2012). 

8. Lande, R. in Sexual Selection: Testing the Alternatives (eds Bradbury, J. W. & 
Andersson, M.) 83-94 (Wiley, 1987). 

9. Bonduriansky, R. & Chenoweth, S. F. Intralocus sexual conflict. Trends Ecol. Evol. 
24, 280-288 (2009). 

10. Kraaijeveld, K. Reversible trait loss: the genetic architecture of fernale 
ornaments. Annu. Rev. Ecol. Evol. Syst. 45, 159-177 (2014). 

11. Poissant, J., Wilson, A. J. & Coltman, D. W. Sex-specific genetic variance and the 
evolution of sexual dimorphism: a systematic review of cross-sex genetic 
correlations. Evolution 64, 97-107 (2010). 

12. Potti, J. & Canal, D. Heritability and genetic correlation between the sexes in a 
songbird sexual ornament. Heredity 106, 945-954 (2011). 

13. Cardoso, G. C. & Mota, P. G. Evolution of female carotenoid colouration by 
sexual constraint in Carduelis finches. BMC Evol. Biol. 10, 82 (2010). 


ar 


a 


370 | NATURE | VOL 527 | 19 NOVEMBER 2015 


14. West-Eberhard, M. J. Sexual selection, social competition, and speciation. 
Q. Rev. Biol. 58, 155-183 (1983). 

15. Lyon, B. E, Eadie, J. M. & Hamilton, L. D. Parental choice selects for ornamental 
plumage in American coot chicks. Nature 371, 240-243 (1994). 

16. West-Eberhard, M. J. Darwin’s forgotten idea: the social essence of sexual 
selection. Neurosci. Biobehav. Rev. 46, 501-508 (2014). 

17. Lyon, B. E. & Montgomerie, R. Sexual selection is a form of social selection. 
Phil. Trans. R. Soc. B. 367, 2266-2273 (2012). 

18. del Hoyo, J., Elliott, A. & Christie, D. A. Handbook of the Birds of the World 
Vols 8-16 (Lynx Edicions, 2003-2011). 

19. Badyaey, A. V. & Hill, G. E. Avian sexual dichromatism in relation to phylogeny 
and ecology. Annu. Rev. Ecol. Evol. Syst. 34, 27-49 (2003). 

20. Hadfield, J.D. MCMC methods for multi-response generalized linear 
mixed models: the MCMCglmm R package. J. Stat. Softw. 33, 1-22 
(2010). 

21. von Hardenberg, A. & Gonzalez-Voyer, A. Disentangling evolutionary 
cause-effect relationships with phylogenetic confirmatory path analysis. 
Evolution 67, 378-387 (2013). 

22. Galvan, |., Negro, J. J., Rodriguez, A. & Carrascal, L. M. On showy dwarfs and 
sober giants: body size as a constraint for the evolution of bird plumage 
colouration. Acta Ornithol. 48, 65-80 (2013). 

23. Ricklefs, R. E. Insights from comparative analyses of aging in birds and 
mammals. Aging Cell 9, 273-284 (2010). 

24. Stutchbury, B. J. & Morton, E. S. Behavioral Ecology of Tropical Birds 
(Academic, 2001). 

25. Bailey, S. F. Latitudinal gradients in colors and patterns of passerine birds. 
Condor 80, 372-381 (1978). 

26. Tobias, J. A., Gamarra-Toledo, V., Garcia-Olaechea, D., Pulgarin, P.C. & 
Seddon, N. Year-round resource defence and the evolution of male and female 
song in suboscine birds: social armaments are mutual ornaments. 

J. Evol. Biol. 24, 2118-2138 (2011). 

27. Irwin, R. E. The evolution of plumage dichromatism in the New World 
blackbirds: social selection on female brightness. Am. Nat. 144, 890-907 
(1994). 

28. Kokko, H. & Johnstone, R. A. Why is mutual mate choice not the norm? 
Operational sex ratios, sex roles and the evolution of sexually dimorphic 
and monomorphic signalling. Phil. Trans. R. Soc. B. 357, 319-330 
(2002). 

29. Emlen, S. T. & Oring, L. W. Ecology, sexual selection, and the evolution of 
mating systems. Science 197, 215-223 (1977). 

30. Jetz, W., Thomas, G. H., Joy, J. B., Hartmann, K. & Mooers, A. O. 

The global diversity of birds in space and time. Nature 491, 444-448 
(2012). 


Acknowledgements We thank the many ornithologists and scientists who have 
published their data or contributed data to public databases, allowing us to 
conduct this study. Thanks to J. D. Aguirre, P. M. Buston, J. Clavel, P. B. Rainey, 

L. Redfern and J. A. Tobias for comments on manuscript drafts and to the staff 
of Museum Victoria and the Australian National Wildlife Collection for access 

to museum specimens. This work was supported by Massey University and a 
grant from the Australian and Pacific Science Foundation (APSF 10/8) to J.D. 
C.J.D. was supported by a Natural Sciences and Engineering Research Council 
of Canada (NSERC) Canadian Graduate Scholarship. K.D. was supported by the 
Australian Research Council (DE120102323). B.K. and M.V. were generously 
supported by the Max Planck Society. 


Author Contributions Conceived of the study: J.D., M.V. and B.K.; collected the 
data: J.D., M.V., K.D. and C.J.D.; developed the methods: J.D. and M.V.; analysed 
the data: J.D. and C.J.D. with help from M.V. and K.D.; wrote the paper: J.D. and 
CJ.D. with input from the other authors. 


Author Information Data sets have been deposited in the Dryad Digital 
Repository (http://dx.doi.org/10.5061/dryad.1rpOs). Reprints and permissions 
information is available at www.nature.com/reprints. The authors declare no 
competing financial interests. Readers are welcome to comment on the online 
version of the paper. Correspondence and requests for materials should be 
addressed to J.D. (j.dale@massey.ac.nz). 


© 2015 Macmillan Publishers Limited. All rights reserved 


METHODS 


Passerine classification and species list. Plumage scores were calculated on each 
of the 5,983 species of passerines (Order: Passeriformes) included in the Handbook 
of the Birds of the World (volumes 8-16)!*. Phylogenetically controlled statistical 
analyses were restricted to the 5,831 of these species that are also included in the 
avian phylogenies at http://www.birdtree.org™’. No statistical methods were used 
to predetermine sample size. 

Plumage scores. We digitally scanned images of each passerine species from the 
plates in the Handbook of the Birds of the World'* into 300 dpi JPEGs using a Fuji 
Xerox ApeosPort-IV C5575 set to default scan settings. Each species was cropped 
out of the scans and used to measure the RGB (red, green, blue) values for six 
patches (nape, crown, forehead, throat, upper breast, and lower breast) on each 
sex of each species using the R package ‘colorZapper’*!. We scored these regions 
because (1) they are consistently illustrated clearly for each species in the plates 
(rumps are often not shown for instance), and (2) the anterior body region is unar- 
guably a very important signalling region for birds in general. For each plumage 
patch, a polygon was subjectively selected that encompassed the typical colouration 
evident in that area of the bird’s plumage (see Fig. 2a for an example with red- 
billed quelea*’). We excluded any obvious areas of glare added by the plate artists. 
colorZapper then calculated the mean values for R, G and B (on scales of 0 to 255) 
for 400 randomly chosen pixels within the selected polygon. In cases where multi- 
ple subspecies were illustrated, we scored colouration in the nominate subspecies. 

Males and females are usually illustrated with the same image in the Handbook 
of the Birds of the World'* when they are sexually monochromatic (N= 3,822 spe- 
cies). In these cases, the same image was measured independently twice, once to 
obtain male RGB values and once to obtain female RGB values. This was required 
to maintain similar measuring error between both dichromatic and monochro- 
matic species. We confirmed that within-species measuring error was consistent 
between species scored with two different images versus species scored with the 
same image twice. In 63 of the 2,161 species where we used separate images for 
males and female colour measurements, females differed from males in ways other 
than the colour of the six plumage patches used in this study (for example, they 
had shorter tails or a different coloured iris). The correlation between male and 
female plumage scores from this sample (R* = 0.905, N= 63) was similar to the 
correlation between male and female plumage scores measured separately from 
the same image (R?=0.913, N= 3,822). 

Sometimes, when there is only a small difference between male and female 
plumages, the same image is used to illustrate both sexes of a species and the dif- 
ference between the sexes is described in the text. In 372 species (6.22% of total) 
we noted a described difference in the text that was not illustrated in the plates. 
Many of these descriptions noted that females were “similar to males” but “duller” 
(N=65), “slightly duller” (N= 44), “paler” (N= 46), or “slightly paler” (N= 29). 
Most of the rest of the described differences related to specific colour differences 
(for example, females described as “browner’, “more rufous’, “less blue’, and so 
on) or to specific colour patches (for example, “coronal patch absent or small”). 
To estimate the magnitude of the measurement error associated with this issue, 
we identified 100 species for which the described difference between the sexes 
was similar to those described earlier (that is, “duller’, “slightly duller’, “paler’, or 
“slightly paler”) but for which we scored male and female colouration from separate 
images. In these species, the average difference between male and female plumage 
scores was 3.82 (N= 100 species, P< 0.001). 

Because these described differences occur in a small percentage of species and 
reflect relatively small differences between the sexes, we expected them only neg- 
ligibly to affect our general results. To confirm that our main results are robust to 
the error generated by these species, we repeated the main MCMCglmm analysis in 
two different ways: (1) with these species removed entirely from the data set (note 
that these species affected 168 of the 2,471 species (6.8%) in the MCMCglmm); 
and (2) with these species’ plumage scores adjusted by subtracting the mean male- 
female difference calculated as described earlier (that is, 3.82) from the female 
scores. Both of these analyses yielded essentially identical results to the analysis 
reported in the paper (the R? between the effects of these analyses and the effects 
reported in Extended Data Table 4 was 0.998 and 0.999, respectively). 

Scoring colouration with handbook plates represents a valid alternative to meas- 
uring colour on all the world’s passerines using museum specimens and/or live 
individuals. Indeed, the objective of handbook plates is to reflect as accurately as 
possible the typical colouration and patterning of a species such that the image is a 
suitable reference for field identification. As such, special care is taken to reproduce 
colouration accurately and the plates in the Handbooks of the Birds of the World 
are highly regarded as superb in quality and consistency (for example, see ref. 33). 
Moreover, previous studies have found that colouration in plates is highly corre- 
lated with colouration in museum specimens (as measured by spectrometry"“), 
and several previous comparative studies have used colouration in plates to test 
hypotheses about avian colour evolution (for example, see refs 35-38). Finally, 
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we validated that plumage scores generated with handbook plates were highly 
correlated with plumage scores generated with ultraviolet-to-visible (UV-Vis) 
spectrometry (see later and Extended Data Fig. 1). 

Plumage scores validation analysis. We validated that plumage scores are con- 
sistent between different colour-measuring methodologies. We used UV-Vis 
spectrometry to measure the reflectance of 8 plumage patches (upper back, 
dorsal neck, crown, forehead, throat, ventral neck, upper breast and lower breast) 
of up to 3 male and 3 female museum specimens for 534 species of Australian 
terrestrial birds (229 non-passerines and 305 passerines). Museum specimens 
were obtained from Melbourne Museum (Museum Victoria) and the Australian 
National Wildlife Collection (CSIRO). Reflectance spectra were collected using a 
spectrometer (Avaspec 2048, Avantes) connected to a xenon pulsed light source 
(Avalight-XE) through a fibre optics cable fitted at the end with a plastic cylinder 
to exclude ambient light and standardize measuring distance. Reflectance spectra 
were expressed relative to a WS-2 white standard using Avasoft software. For each 
species we calculated mean reflectance spectra per patch separately for males and 
females. To compute plumage scores we pooled together all male and female mean 
spectra for each patch (N= 1,068 spectra for each patch). We matched each spec- 
trum in the pool to the 120 most similar spectra as determined by minimizing the 
squared-differences between spectra at 5nm wavelength intervals, summed across 
300-700 nm. We then quantified the percentage of those 120 ‘closest matches’ 
that corresponded to male spectra. For each species, male and female UV-Vis 
plumage scores were calculated as the mean ‘percentage male’ values across all of 
the eight patches measured in each sex of each species. For these same 534 species 
we also quantified patch colour using handbook plates digitally scanned from 
those published previously!* and then scoring them in RGB colour space using 
the R-package colorZapper”! (see earlier and Fig. 2). The 120 closest matching 
colour patches to any focal patch were determined by minimizing the Euclidian 
distance in RGB space, and then the proportion males in those closest matches 
was quantified. Note that this method of quantifying colouration is sample-size 
dependent: larger samples of species will provide more accurate measures of how 
‘male-like any kind of colour is. Nevertheless, despite having only 534 species in 
this sample (contrasting with 5,983 species in the main analysis), both ‘male-like’ 
indices were strongly positively correlated (R’=0.67, P< 0.0001; Extended Data 
Fig. 1a), indicating that using handbook plates to estimate bird colours is a suitable 
alternative to the use of reflectance spectrometry. 

To demonstrate further that plumage scores calculated with handbook plate 
measurements are suitably interchangeable with plumage scores calculated with 
UV-Vis spectra, we repeated statistical analyses presented in the paper with the 
subset of passerine species for which spectral data were collected. If handbook 
plates provide a suitable surrogate measure then conclusions drawn from spectral 
data on the 305 species should be similar to conclusions drawn from the same 
305 species based on plate data. First, we compared effect size estimates for single 
predictor PGLS models run on each of the ten predictor traits in Extended Data 
Table 2a. We ran models predicting female scores, male scores and dichroma- 
tism scores (N= 30 effects in total). Our analysis showed a very strong correla- 
tion between RGB effect sizes versus UV-Vis effect sizes (Extended Data Fig. 1b). 
Second, we compared effect size estimates (including sex—predictor interactions) 
for MCMCglmm models run with our five main predictors in Fig. 4. For each 
plumage score type we ran five models using a different phylogenetic tree™ in each 
model. This analysis showed a very strong correlation between RGB effect sizes 
versus UV-Vis effect sizes (Extended Data Fig. 1c). In combination, these analyses 
strongly indicate that the biological patterns reported in the main analysis would 
not have been different if we had used plumage scores based on UV-Vis spectra. 
These results thus provide critical validation of our method because although 
human and avian vision have considerable overlap!” birds can also see UV light 
not visible to humans”°. 

Predictor variables. Body size (mass and wing length) was tested because it is a 
known confound of sexual size dimorphism“! and life-history traits”. In addi- 
tion, based on a previous study”? we predicted a negative relationship between 
body size and colour elaboration. Tropical life history (latitude, seasonality and 
clutch size) was tested to evaluate the prediction that characteristics associated 
with tropical breeding—in particular, density-dependent or ‘K-selected’* factors 
such as increased competition for limited breeding vacancies*4—were associated 
with increased plumage colour elaboration. Sexual selection (social mating system, 
sexual size dimorphism, and paternal care) was tested because the prevailing view 
in the literature is that sexual selection on males is the fundamental evolutionary 
driver of male colour elaboration and sexual dimorphism. However, recent studies 
on New World blackbirds?”*“5 (Icteridae), tanagers*® (Thraupidae) and fairy- 
wrens” (Maluridae) have demonstrated rapid evolutionary transitions in female 
colouration (also see ref. 48). Therefore we also expected sexual selection on males 
to potentially have a strong effect on female colouration. Cooperative breeding 
species were predicted to be more colour elaborated because of increased social 
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competition among group members and this effect was predicted to be stronger 
in females than in males’. Migration was tested to evaluate two opposing pre- 
dictions. On the one hand, it has been argued” that species with long-distance 
migration should be less colourful as a result of increased predation associated 
with long-distance movements (also see ref. 51). In contrast, other researchers!” 
have argued that migratory species should be more colourful because migration 
imposes a short mate sampling period and enhances selection on signals related 
to migration ability. 

Body mass data was taken from that described previously**. When more than 
one body mass entry was available for a species, we computed the mean weighted 
by sample size. Log-transformed values were used in the statistical analysis. 

Wing length data was taken from that described previously*". Up to 7 different 
sets (average = 2.5) of wing measurements were recorded per species. Each set 
comprised the means (or mid-ranges when only a range was provided) for both 
males and females measured in a single population, and we took the means of these 
for final species values. Log-transformed values were used in the statistical analysis. 

For latitude data, species’ geographical location was computed based on the 
breeding range maps of all species. First, breeding range polygons were trans- 
formed from an unprojected coordinate system (latitude, longitude) to an equal- 
area projected coordinate system (cylindrical equal-area, latitude of the origin: 0°). 
Species’ geographical location was then computed as the latitude (degrees from 
equator) of the breeding range centroid. 

For seasonality, Moderate Resolution Imaging Spectroradiometer (MODIS) 
land surface temperature rasters (code name MOD11C2) were obtained through 
the http://reverb.echo.nasa.gov gateway at a 0.05° spatial resolution and an 8-day 
temporal resolution (time span: 2000-2012)°. All the raster files were then super- 
posed and a temperature time series was obtained for each pixel. A coefficient of 
variation (CV%) of temperature was computed for each pixel™, For each spe- 
cies the CV% for all pixels within its breeding range were extracted. A species 
‘seasonality’ score was defined as the median CV% score within its breeding 
range (log transformed). 

Clutch size was compiled from standard references, in particular'*°*. Up to 
five different reports of clutch sizes were recorded for each species, with the mean 
of these taken as the final clutch size of a species. Clutch size was recorded as the 
mean clutch size or, when only range data was provided, as the mid-range value. 
Log-transformed values were used in the statistical analysis. 

Sexual size dimorphism was calculated as log(male wing length) — log(female 
wing length)*", providing a proportional index of relative sizes of the sexes. Positive 
values reflect species where males are larger than females. 

Social polygyny was scored on a four-point scale*, with 0 = strict social monog- 
amy (for example, zebra finch Taeniopygia guttata), 1 = monogamy with infre- 
quent instances of polygyny observed (<5% of males, for example, lazuli bunting 
Passerina amoena), 2= mostly social monogamy with regular occurrences of facul- 
tative social polygyny (5 to 20% of males, for example, American redstart Setophaga 
ruticilla), and 3 = obligate resource defence polygyny (>20% of males, for exam- 
ple, red-winged blackbird Agelaius phoeniceus) or lek polygyny (for example, 
lance-tailed manakin Chiroxiphia lanceolata). Assignments were made based on 
standard references, in particular!®4!°*®, There are a small number of passerine 
species with polygynandrous mating systems (for example, the dunnock Prunella 
modularis, Smith’s longspur Calcarius pictus and sickle-billed vanga Falculea 
palliata) and these species were pooled with the monogamous species. We reasoned 
that sexual selection would be more similar in each sex in polygynandrous species 
compared with polygynous species, and our social polygyny scores were intended 
to specifically quantify male-biased sexual selection. 

Paternal care was scored as absent (0) or present (1) primarily based on the 
data provided in ref. 63, including both the known and inferred data categories. 
For species in our data set not present in ref. 63, we used standard references, in 
particular ref. 18, to obtain the additional parental care scores. 

Cooperative breeding was scored as absent (0), suspected (0.5), or present (1) 
primarily based on the data provided in ref. 63, including both the known and 
inferred data categories. For species in our data set not present in ref. 63, we used 
standard references, in particular ref. 18, to obtain the additional cooperative 
breeding scores. 

Migration was scored on a scale from 0 to 2, with 0 = resident (that is, breeding 
and non-breeding ranges identical), 1 = partial migration (that is, some overlap 
between breeding and non-breeding ranges), 2= complete migration (that is, no 
overlap between breeding and non-breeding ranges). Assignments were made 
based on the range maps published previously'®. The migratory behaviour of one 
species, the Red Sea Swallow Hirundo perdita, is unknown. 

Statistical analysis. Because closely related species tend to be more similar to each 
other than distantly related species, we used phylogenetically informed compara- 
tive analyses to account for potential non-independence among species owing to 
common ancestry. More specifically, the error structure of the statistical model 


incorporates the degree of non-independence between species as estimated from 
the phylogeny. Phylogenetically informed methods are unlike ordinary statistical 
models (where data points are assumed to be independent) because they explicitly 
model how the covariance between species declines as they become more distantly 
related®*, We used the Hackett®” backbone phylogenetic trees available at http:// 
birdtree.org” to estimate phylogenetic separation in our statistical models. 

The relationships between the ten predictor variables listed earlier and sexual 
dichromatism (Extended Data Table 2) were modelled with phylogenetic general- 
ized least-squares! ©*°* (PGLS) using the R-packages ‘ape’ and ‘nlme’””. 

To calculate phylogenetic reduced major axis regression”! and the phylogenetic 
principal components” used in the multiple predictor PGLS, MCMCglmm and 
paths analysis we used the R-package ‘phytools’”. 

All data were analysed in R” version 3.1.0. To improve the interpretability of 
regression coefficients”, predictor variables were centred and standardized to a 
mean =0 and standard deviation = 1. 

Phylogenetically informed generalized linear mixed models. Monte Carlo 
Markov chain generalized linear mixed models (Fig. 4 and Extended Data Table 4) 
were generated with the R-package ‘MCMCglmm”°’. MCMCglmm is a Markov 
chain Monte Carlo sampler for multivariate mixed models that enables the inclu- 
sion of a phylogeny as a design matrix in a Bayesian generalized linear modelling 
framework. The design matrix for phylogenetic effects was based on 100 trees from 
http://birdtree.org*®. We then used MCMCglmm to fit male and female plumage 
scores as the response and the five predictor variables as continuous predictors. 
Phylogenetic effects were considered a random effect, sex was fit as a dummy var- 
iable and species was fit as an observation level random effect. We used the prior: 
[list(R =list(V = 1, nu = 0.002), G=list(G1 =list(V = 1, nu=0.002)))] and model 
outcomes were insensitive to prior parameterization. We let the MCMC algorithm 
run for 10,000,000 iterations, with a burn in period of 3,000 and a sampling interval 
of 10,000. Each model generated ~1,000 independent samples of model parame- 
ters (Extended Data Table 4). Independency of samples in the Markov chain was 
assessed by graphic diagnostics and testing for autocorrelation between samples. 

Multivariate Ornstein-Uhlenbeck evolutionary models. Ornstein-Uhlenbeck 
(OU) models are powerful tools for analysing the evolution of traits, and are well 
suited to studies of correlated evolution’®, These models are a generalization of 
Brownian motion (BM) models”, in that they incorporate stochastic trait evo- 
lution (defined by a drift parameter o), but also allow trait values to be attracted 
towards optima, at a rate that is dependent on the value of « (the strength of selec- 
tion towards the optima). When a=0, the process reduces to Brownian motion; 
however, many patterns of trait evolution are better fit by OU models, because 
trait evolution is often subject to stabilizing selection, at least over evolutionary 
time scales’*. Additionally, the development of multivariate OU (mvOU) models 
has allowed trait evolution to be influenced by interactions with other trait values. 
Indeed, trait evolution is often thought of as a multivariate process, and mvOU 
models allow researchers to explore the influence of co-adaptive or limiting forces 
on trait evolution. 

We used mvOU models to test whether the evolution of plumage ornamenta- 
tion was subject to cross-sex constraints. Using the mv MORPH” package in R, 
we constructed five evolutionary models of the potential relationship between 
male and female ornamentation (using the plumage scores described in the main 
text (V=5,831 species)) and compared their fit with AIC’®. In model 1, male 
and female ornamentation evolved completely independently (that is, both the 
a and o matrix were restricted to be diagonal) under OU processes, simulating 
no evolutionary relationship between male and female plumage colour. In model 
2, male and female plumage scores evolved independently towards their optima 
(a diagonal « matrix); however, we allowed covariation in the stochastic element 
(that is, drift) of the OU process. This model might reflect a situation where male 
and female ornamentation respond similarly to some environmental parameter, 
but are not directly constrained by one another. In model 3, the evolution of male 
and female plumage ornamentation was influenced by an interaction with the 
other sex’s trait value (a symmetric positive « matrix), and we allowed covariance 
in the stochastic element of trait evolution. Such a model represents cross-sex con- 
straints in plumage colour evolution. Models 4 and 5 were BM models including 
independent and covarying drift matrices, respectively. 

Model 3 was the clear best model (Extended Data Table 1a), demonstrating that 
male and female plumage colour do not evolve independently. Positive off-diagonal 
elements in the « matrix of this model (Extended Data Table 1b) suggest that male 
and female plumage values are pulled towards one another, consistent with cross- 
sex constraints against independent trait evolution, and the correlated response 
hypothesis. Models of fully independent trait evolution performed poorly (model 
1 and 4), and models including covariation in drift performed better than those 
with independent drift. This suggests that in addition to an interaction in selection 
on male and female plumage colour, the sexes generally have a correlated response 
to ecological conditions. 
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Phylogenetic path analysis. Confirmatory path analysis is a special case of struc- 
tural equation modelling used to build models of causal relationships among a 
suite of variables and test how the data conform with those causal models***". 
When conducted in a phylogenetic framework”!*”, this approach can infer the 
most likely evolutionary pathways by simultaneously considering both indirect 
and direct effects among variables, as well as the magnitude of these effects. This 
technique is advantageous in that it better reflects the true nature of evolutionary 
processes because factors do interact in complex ways (for example, by acting as 
both causal parents and causal children) to create evolutionary outcomes (that is, 
phenotypes). Here, we used the d-separation?!*°*? method to test for the most 
likely relationships among seven variables related to plumage colouration. The 
variables used were the five principal components derived from the phylogenetic 
principle components analysis (Extended Data Table 3), as well as male plumage 
score and female plumage score. First, we built 14 biologically relevant models of 
the relationships among these variables (Extended Data Fig. 4). These models are 
constructed as directed acyclic graphs, which are required for this type of analysis. 
Model construction was based on the hypothesized relationships among variables 
that had been suggested in previous studies (for example, see refs 19, 27, 28) and 
from exploratory analysis of correlations among variables. Each model is then 
converted to a set of conditional independencies of the form (X1, X2) {X3}, where 
variables X1 and X2 are independent conditional on variable X3 (see elsewhere®”* 
for details). We then tested the set of conditional independencies in each model 
using phylogenetic generalized least squares (GLS) models (using the nlme” pack- 
age in R“) anda single tree that was randomly selected from the tree pool at http:// 
birdtree.org*®. Using these GLS models, we calculated Fisher's C-statistic (which is 
equivalent to a maximum-likelihood estimate*’) and the C-statistic information 
criterion (CICc) for each conceptual model*”*?, 

Comparison of models based on CICc values indicated that model K was the 
best model (Extended Data Table 5). Model L appears to also be a competitive 
model, however, visual comparison of models K and L show that they suggest the 
same causal relationships among variables, except model L contains one additional 
causal link (between life history and sexual selection). The addition of a single, 
uninformative variable often causes marginal changes in CIC and can therefore 
appear to create competitive models, although in such a case the simpler model 
should be preferred®*°, Models including a direct effect of male colour on female 
colour performed better than those that did not include this effect (G, I) and those 
in which there was a direct effect of female colour on male colour (M, N). However, 
a model in which female colour was only directly influenced by male colour and 
not by other variables (C) also performed poorly. This suggests that female coloura- 
tion is not simply a genetically correlated response to selection on male colouration, 
and is instead affected by other variables. The best model (K) generally supports 
the results of the MCMCglmm analysis (Fig. 4 and Extended Data Table 4). 
This model (Fig. 5) suggests that body size, cooperative breeding, migration, 
tropical life history, sexual selection and male colour all directly influence female 
colouration, while only sexual selection, tropical life history and body size directly 
influence male colouration. The magnitude and direction of the standardized 
regression coefficients (Fig. 5) suggest that there is a strong role of social selection 
on female colouration. Females become more colourful with increasing body size, 
more tropical life histories and cooperative breeding, factors that are predicted 
to increase the intensity of competition (see main text). Also consistent with our 
MCMCglmm analysis, the path analysis shows that sexual selection has direct, 
but antagonistic effects on both male and female colouration. Interestingly, the 
negative effect of sexual selection on female colouration is larger than the positive 
effect on male colouration. This pattern supports the finding that sexual selection 
has an overall negative effect on passerine colouration. 

Code availability. Scripts of analyses and code used for figure production are 
available upon request from J.D. 
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Extended Data Figure 1 | Comparison of plumage scores determined model had 305 Australian passerine species in it and black, red and 
with handbook plates in RGB colour space with plumage scores green points reflect models predicting female scores, male scores and 
determined with study skins using UV-Vis spectrometry. a, RGB versus — dichromatism scores, respectively). c, MCMCglmm model effect sizes 
UV-Vis scores calculated with 534 Australian bird species (reduced (including interaction effects) determined with RGB versus UV-Vis 
major axis (RMA) regression: y = 0.965x + 2.025, N= 1068, R?=0.670, scores (RMA regression: y = 0.991x — 0.028, R?=0.930, P< 0.0001, N=11 
P<0.0001). b, PGLS model effect sizes determined with RGB scores effects, the model had 305 Australian passerine species in it and each point 
versus effect sizes determined with UV-Vis scores (RMA regression: reflects the mean effect size calculated from five models using a separate 
y =0.878x — 0.042, N= 30 model effects, R* =0.809, P< 0.0001, each phylogenetic tree from http://birdtree.org each). 
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Extended Data Figure 2 | Basic patterns of plumage colouration in the 
order Passeriformes (N= 5,983 species). a, Male versus female patch 
scores (upper breast shown as an example). Data points are coloured 

by the male RGB values scored from handbook plates. b, As in a, only 
points are coloured with female RGB scores. c-h, The colours associated 
with different plumage scores differentiated by patch type. The y axis is 


30 40 50 60 70 80 


Lower-breast score 


a normally distributed random number used to spread out variation and 
improve visualization. The figure reflects patterns that agree with our 
intuition: dull greens, olives and browns have low plumage scores 

(are female-like), while richer or high contrasting colours (blacks, purples, 
blues, reds and yellows) have high scores (are male-like). 
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Extended Data Figure 3 | MCMCglmm results are robust to various 
cut-offs used to determine plumage scores. a, b, Main effects (a) and 
interaction (b) with sex effects. Filled circles: estimates where P < 0.001; 
open circles: estimates where 0.05 > P > 0.001. The vertical line represents 
the cut-off value used in the main analysis. To reduce processing time 

for this analysis each MCMCglmm model ran for 10,000 iterations with 

a sampling interval of 100. Parameter estimates from the shortened runs 


were highly similar to the parameter estimates reported in the main 
MCMCglmm analysis (see Extended Data Table 4). Note that the observed 
decline in effect sizes as the cut-off size increases is the automatic outcome 
of how plumage scores are calculated. As the cut-off size approaches 100% 
the variance in plumage scores necessarily approaches 0 and so the effect 
sizes will inevitably also approach 0. 
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Extended Data Figure 4 | Candidate models for phylogenetic confirmatory path analysis. a~n, Arrows indicate hypothesized direct links between 


variables. 
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Extended Data Table 1 | Comparison of multivariate co-evolutionary 
models of male and female plumage 


ornamentation 
a 

Model a o Peas 7, AL AAIC 
3 correlated correlated -39068 78153 - 
2 independent correlated -39101 78217 64 
1 independent independent -40140 80292 2139 
5 - correlated -43481 86973 8820 
4 - independent -45156 90321 12168 


Parameter Estimate 
8 female = 47.52 
male = 51.02 
2.30 0.90 
a 
0.90 1.22 


290.55 216.50 


216.50 222.17 


a, Multivariate Ornstein-Uhlenbeck (OU) models are ordered by Akaike information criterion (AIC) 
value from best (top) to worst. The a matrix determines the strength of selection and can 
incorporate an interaction among traits (‘correlated’) or can model independent selection on each 
trait (‘independent’). The o matrix determines the amount of variation (drift) in the stochastic 
evolutionary process, which can be independent or correlated among traits. Brownian motion 
models (4 and 5) do not contain a parameters. b, Parameter estimates for the best model (Model 3 
in a). 6 values are the estimated trait optima towards which male and female plumage colour evolve 
under OU processes. The a and o matrices are shown for female-specific values in the top left cell, 
male-specific values in the bottom right cell and the interaction between trait values in the 
off-diagonal elements. 
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Extended Data Table 2 | Phylogenetic generalized least-squares models on predictors of sexual dichromatism in the Passeriformes 


Variable N species B Std.Error t-value P 

a, Single predictor models Body Mass 5066 -0.76 OA -4.39 < 0.0001 
Wing Length 2855 -0.66 0.25 -2.66 0.011 
Degrees from equator 5831 0.60 0.09 6.77 < 0.0001 
Seasonality 5831 0.64 0.09 7.02 < 0.0001 
Clutch size 3909 0.51 0.13 4.01 < 0.0001 
Social polygyny 3504 1.51 0.15 9.83 < 0.0001 
Sexual size dimorphism 2855 0.70 0.15 4.59 < 0.0001 
Paternal care 5831 -0.99 0.12 -8.29 < 0.0001 
Cooperative Breeding 5831 -0.25 0.09 -2.77 0.007 
Migration 5830 0.81 0.09 9.37 < 0.0001 

b, Multiple predictor model Body size 2471 -0.95 0.25 -3.70 0.0002 
Tropical Life History 2471 -0.28 0.20 -1.40 0.16 
Sexual Selection 2471 1.76 0.19 9.20 <0.0001 
Cooperative Breeding 2471 -0.28 0.14 -2.01 0.04 
Migration 2471 0.81 0.19 4.33 <0.0001 


To account for phylogenetic non-independence, models included a phylogenetic correlation structure using phylogenetic trees from http://birdtree.org?°. Provided are the model-averaged predictions 
of 20 models (each using a different tree), with sexual dichromatism (that is, male plumage score — female plumage score) as the dependent variable. The phylogenetic signal, estimated as Pagel’s A 
coefficient, was A= 0.80 (mean; range: 0.76-0.84). Predictor variables were scaled to a mean of O and standard deviation of 1. 
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Extended Data Table 3 | Phylogenetic principal component analysis loadings and the variance explained by each component 


Component Loadings 


Predictor Variable PPC1 PPC2 PPC3 
Body size Body mass 0.48 -0.13 
Wing length 0.48 0.13 
% Variance 93.15 6.85 
Tropical life history Clutch size -0.39 0.62 0 
Degrees from equator -0.46 -0.27 -0.3 
Seasonality -0.46 -0.26 0.3 
% Variance 76.59 17.32 6.09 
Sexual selection Sexual size dimorphism -0.37 0.85 -0.01 
Social polygyny -0.57 -0.27 0.51 
Paternal care 0.57 0.28 0.5 
% Variance 53.81 29.15 17.05 


Provided are the mean values obtained from phylogenetic principal component analysis of 20 separate phylogenetic trees from http://birdtree.org. The first components were used in the main 
analysis. PPC, phylogenetic principal component. 
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Extended Data Table 4 | Morphological, life-history and social correlates of plumage colour scores in passerine birds (N= 2,471 species) 


Predictor 

(Intercept) 

sex 

body size 

life history 

sexual selection 
cooperative breeding 
migration 

sex : body size 

sex : life history 

sex : sexual selection 
sex : cooperative breeding 


sex : migration 


Shown are posterior estimates of the effect size in the full model (plus the 95% credibility intervals, the effective sample size (N), and the probability that the effect is different from the null hypothesis). 
The full model was the deviance information criterion (DIC) favoured model out of all possible additive models (N= 243 models) that include sex as well as all possible combinations of the five 
predictor variables and their respective interactions with sex. Values are means of 100 separate runs on each of 100 separate phylogenetic trees from http://birdtree.org. 


Posterior mean 
45.91 
4.24 
2.60 
0.91 
-1.95 
0.40 
-0.35 
-1.14 
0.29 
2.70 
-0.50 


0.70 


Lower 95% C.1. 
40.64 
3.95 
2.16 
0.53 
-2.30 
0.13 
-0.72 
-1.44 
-0.09 
2.41 
-0.80 


0.32 


Upper 95% C.1. 
51.33 
4.53 
3.06 
1.30 
-1.61 
0.68 
0.01 
-0.86 
0.67 
3.00 
-0.21 


1.07 
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N 
987 
1018 
1023 
1024 
1009 
1045 
1015 
1024 
1016 
1007 
1018 
1015 


pMCMC 
< 0.001 
< 0.001 
< 0.001 
< 0.001 
< 0.001 
0.005 
0.063 
< 0.001 
0.14 
< 0.001 
0.0015 


0.0011 


Extended Data Table 5 | Comparison of models used in phylogenetic 
confirmatory path analysis 


Model 
K 


Oo 7 F OU WDTtT me fF 2 fF 


G 


Cc 
28.81 
27.90 
43.04 
42.13 
169.00 
184.74 
189.26 

207.37 
244.17 
277.68 
304.60 
377.06 
790.13 
1054.12 


ClCc 
67.12 
68.25 
81.35 
82.47 
201.23 
214.93 
215.41 
235.54 
274.37 
303.83 
330.75 
401.19 
820.32 
1078.24 


ACICc 
0 
1.13 
14.23 
15.35 
134.11 
147.81 
148.29 
168.42 
207.25 
236.71 
263.63 
334.07 
753.20 
1011.12 


Models are ordered by ClCc value from best (top) to worst. C indicates Fisher's C-statistic. 
For a visual illustration of the models see Extended Data Fig. 4. 
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Migratory neuronal progenitors arise from the 
neural plate borders in tunicates 


Alberto Stolfi!, Kerrianne Ryan’, Ian A. Meinertzhagen? & Lionel Christiaen! 


The neural crest is an evolutionary novelty that fostered the 
emergence of vertebrate anatomical innovations such as the 
cranium and jaws'. During embryonic development, multipotent 
neural crest cells are specified at the lateral borders of the neural 
plate before delaminating, migrating and differentiating into 
various cell types. In invertebrate chordates (cephalochordates 
and tunicates), neural plate border cells express conserved factors 
such as Msx, Snail and Pax3/7 and generate melanin-containing 
pigment cells‘, a derivative of the neural crest in vertebrates. 
However, invertebrate neural plate border cells have not been shown 
to generate homologues of other neural crest derivatives. Thus, 
proposed models of neural crest evolution postulate vertebrate- 
specific elaborations on an ancestral neural plate border program, 
through acquisition of migratory capabilities and the potential 
to generate several cell types°-’. Here we show that a particular 
neuronal cell type in the tadpole larva of the tunicate Ciona 
intestinalis, the bipolar tail neuron, shares a set of features with 
neural-crest-derived spinal ganglia neurons in vertebrates. Bipolar 
tail neuron precursors derive from caudal neural plate border cells, 
delaminate and migrate along the paraxial mesoderm on either side 
of the neural tube, eventually differentiating into afferent neurons 
that form synaptic contacts with both epidermal sensory cells and 
motor neurons. We propose that the neural plate borders of the 
chordate ancestor already produced migratory peripheral neurons 
and pigment cells, and that the neural crest evolved through the 
acquisition of a multipotent progenitor regulatory state upstream 
of multiple, pre-existing neural plate border cell differentiation 
programs. 

Progenitor cells that fulfil all the criteria defining the neural crest 
have not been observed outside vertebrates. These criteria include an 
embryonic origin at the lateral borders of the neural plate, epithelium- 
to-mesenchyme transition (EMT), migratory behaviour and the 
potential to differentiate into diverse cell types such as neurons, bone, 
cartilage and pigment cells. 

In cephalochordates (amphioxus) and the tunicates Halocynthia 
and Ciona, a subset of neural plate border cells deploy a conserved 
melanocyte-specific gene network but do not migrate away from 
the neural tube”. Instead, they contribute locally to pigmented 
photoreceptor organs. In Ciona, the pigment cell precursors undergo 
an epithelial-to-mesenchymal transition and remain inside the neural 
tube lumen, but can be induced to exit the neural tube through targeted 
mis-expression of the mesenchyme-specific transcription factor Twist- 
related’. Migratory pigment cell precursors have also been reported in 
larvae of the tunicate Ecteinascidia turbinata®. 

In contrast, invertebrate homologues of neural-crest-derived neurons 
have so far proved elusive. In tunicates, various neurons arise from the 
neural plate borders, but these remain in the dorsal neural tube or in 
the epidermis”"®, instead of delaminating and migrating as would be 
expected for homologues of vertebrate neural-crest-derived neurons. 
Migratory sensory neurons have been described in cephalochordate 


embryos, but these arise from ventral epidermis, not the neural plate 
borders, and reinsert into the epidermis after migrating''. 

The recently identified bipolar tail neurons (BI'Ns)” of Ciona larvae 
form axon fascicles that extend along the length of the tail on either side 
of the neural tube (Fig. 1a). These neurons express the proneural basic 
helix-loop-helix transcription factor Neurogenin (Neurog, Fig. 1b) and 
the LIM-homeodomain factor Islet (Fig. 1a). Vertebrate Neurogenin 
and Islet orthologues are involved in specifying various neuronal 
subtypes including neural-crest-derived dorsal root ganglia neurons 
(DRGNs), which also have a bipolar or pseudo-unipolar morphology 
and transmit peripheral mechanosensory inputs to the central nervous 
system!%. Ciona BTNs also express Asic, the orthologue of acid-sensing 
ion channels (ASICs)'* that modulate touch sensitivity in vertebrate 
DRGNs. These parallels prompted us to investigate the embryological 
origins of the BTNs. 


Ciona intestinalis larva — 


Proximal process Distal process 


BT precursors 


@ Neural plate 
@ Neural plate borders 
@ Pigment cell lineage 
@ BTN lineage 


@ Ingressing mesoderm 


Gastrula stage (st. 12) 


Neurula stage (st. 15) 


Figure 1 | Bipolar tail neurons come from the borders of the neural plate. 
a, Larva with a BTN labelled by Islet BIN >unc-76::eGFP (green). Bottom, 
enlarged view of BTN above. Scale bars, 75 um (top); 25 um (bottom). 

b, Migrating BTN precursors (arrowheads) labelled by the b-line-specific 
Neurog b-line >unc-76:: Venus reporter construct (green). Scale bar, 25 um. 

c, In situ hybridization for Neurog (magenta) in an embryo electroporated 
with Msx >nls::lacZ plasmid (immunolabelling of B-galactosidase in green). 
White arrowhead, Msx*/Neurog* BTN progenitor. Dashed arrowhead, 
transient Neurog expression in BTN progenitor’s sister cell (epidermal 
progenitor). Dashed line, midline. Scale bar, 25 um. d, In situ hybridization 
for Neurog (red) and Snail (green). Scale bar, 25 jum. Inset is enlarged box 
showing low levels of Snail expression in BTN progenitor. e, Pax3/7 in situ 
hybridization (green). Scale bar, 25 um. Enlarged box inset showing Pax3/7 
expression in BTN progenitor. f, Adapted illustration!” of embryos showing 
position of pigment cell and BTN progenitors (and their descendants) in the 
neural plate borders. Lateral views in a, b, dorsal views in c-f. Anterior to the 
left throughout; st., stage. 
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e 
pBTN 
St. 21 — 
f Neurog 


Nodal b8.18 


Figure 2 | Bipolar tail neuron precursors delaminate and migrate. 

a, Embryos electroporated with Neurog b-line >unc-76::eGFP. Top, BTN 
precursor (arrowhead) extending a lamellipodium. Bottom, BTN precursors 
(anterior (aBTN) and posterior (pBTN)) delaminating. Scale bar, 25 um. 

b, Enlarged view of aBTN in lower panel of a. Top, UNC-76::eGFP; 

middle, phalloidin; bottom, merged; arrowhead, part of aBTN still in the 
epithelium. Scale bar, 10 um. c, Embryo with paraxial mesoderm labelled 

by Mrf>unc-76::eGFP (green), BTN labelled by Neurog b-line >unc- 
76::mCherry (magenta) and phalloidin counterstain. Scale bar, 25 um. 
Right, cross-sectioned 3D image of same embryo. Only the right side of the 
embryo was transfected. d, 3D slice of embryo showing BTNs (arrowheads) 
outside neural tube (dotted outline). Only the right side of the embryo was 
transfected. e, Diagram of d showing BTNs relative to other tail tissues: 
neural tube (NT), notochord (NC), myoblasts (M) and endoderm (EN). 

f, Time series of different embryos co-electroporated with Neurog 

b-line >unc-76:: Venus YFP (green) and Nodal b8.18>H2B::mCherry 
(magenta). Right panels are enlarged views of the images on the left. Dashed 
lines indicate displacement from clonally related epidermal cells (indicated 
by colour-coded brackets). Scale bars, 25 um. 


We detected the earliest expression of Neurog at neurulation, in the 
caudal-most neural/epidermal boundary cells, which express the con- 
served neural plate border specification genes Msx'*, Pax3/7 (ref. 3 ) 
and Snail'® (Fig. 1c-f and Extended Data Fig. 1). During neurulation, 
these cells drive neural tube closure and their progeny eventually form 
the neural tube roof plate and dorsal epidermis midline’”'® (Fig. 1b and 
Extended Data Fig. 2). BI'N progenitors are thus born from the caudal 
extensions of the lateral borders of the neural plate (Fig. 1f). 

We isolated a Neurog cis-regulatory element that drives reporter 
gene expression in this caudal neural plate border region (Extended 
Data Fig. 3). Using this reporter, we determined that Neurog expres- 
sion is progressively restricted and maintained in only two cells on 
each side of the bilaterally symmetric embryo, born during neural tube 
closure (Extended Data Figs 2 and 4). We have named these the ante- 
rior (aBTN) and posterior (pBTN) BTN precursors. Shortly after the 
completion of neural tube closure, BTN precursors delaminate and 
migrate anteriorly along the paraxial mesoderm on either side of the 
neural tube’? (Fig. 2a-f and Supplementary Videos 1-3). This is evoc- 
ative of vertebrate DRGN progenitors, which migrate through paraxial 
mesoderm situated lateral to the neural tube. 

Double-labelling with a Nodal reporter revealed that BI'Ns arise from 
two adjacent but clonally distinct cell lineages (Fig. 2g and Extended 
372 
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DMSO at 5.5 h.p.f 


U0126 at 5.5 h.p.f U0126 at 7.5 h.p.f 


Neurog 


DMSO at 5.5 h.p.f |e U0126 at 7.5 h.p.f 
Neurog 


+H2B 


Neurog Asic 


Neurog 


Control 


Neurog Phalloidin 


4 
Protocadherin.c 
Control 


Figure 3 | Bipolar tail neuron specification and differentiation. a, Wild- 
type Neurog b-line >unc-76:: Venus YFP expression (green) in embryos treated 
with DMSO vehicle, counterstained with DAPI (blue). b, Neurog expression 
was abolished in 43 of 50 embryos treated with 10 uM MEK inhibitor U0126 
at 5.5 hours post-fertilization (h.p.f.). c, Supernumerary BTNs were specified 
in 28 of 50 embryos treated with 10 uM U0126 at 7h.p.f. d, Two BIN 
precursors, labelled by Neurog b-line>unc-76:: Venus YFP (green) and Neurog 
b-line>H2B::mCherry (magenta), migrating on one side of a DMSO-treated 
embryo. e, Expanded chain of four BTNs resulting from treatment with 
U0126 at 7h.p.f. f, BTN expressing Asic >unc-76::eGFP reporter in embryo 
electroporated with Neurog b-line >nls::lacZ as a control. g, Overexpression of 
Neurog induces specification of ectopic Asict BTNs in 53 of 100 embryos. 

h, Overexpression of a dominant repressor form of Neurog (Neurog::WRPW) 
abolishes BTNs in 97 of 100 embryos. i, j, In situ hybridization reveals 
expression of Cadherin.b (green) in the neural tube but not migrating 

BTN precursors (i) and expression of Protocadherin.c (green) in dorsal 
epidermis midline but not BTNs (j). Embryos in i, j electroporated with 
Neurog b-line >unc-76::mCherry (immunolabelling of mCherry in magenta). 
k, Forced overexpression of protocadherin.c in the BTN lineage using the 
Neurog b-line driver inhibits delamination and migration of BTNs in 7 of 14 
embryos. 1, Normal BTNs as seen in 9 of 12 control embryos (overexpression 
of B-galactosidase instead). Embryos in k, ] electroporated with Neurog 
b-line >unc-76:: Venus YFP and Neurog b-line >H2B:: Venus YFP (green) and 
counterstained with phalloidin (magenta). All scale bars 25 um. Embryos in 
a-e, i, j fixed at stage 22. Embryos in f-h, k, lat stage 23. 


Neurog overexpression 


Neurog::WRPW 


Data Fig. 2). The pBTN arises from the tail tip (b8.21 lineage)'® 
and migrates to meet the b8.18-derived aBTN as it delaminates (Fig. 2a, f). 
Together, they continue their migration as a chain of two cells. 
Neurog expression distinguishes the BT'Ns from the caudal epider- 
mal sensory neurons (CESNs), which remain at the dorsal midline 
and are specified instead by an atonal homologue (Atoh)-dependent 
regulatory program!*”°. We found that the onset of Neurog expression 
requires MAPK/ERK signalling (Fig. 3a, b). However, later inhibition 
of MAPK/ERK resulted in the upregulation of Neurog in non-neural 
cells of the lineage, converting these into supernumerary BTNs 
(Fig. 3c-e and Extended Data Fig. 4). In contrast, perturbing Delta/ 
Notch signalling did not alter BTN specification or differentiation 
(Extended Data Fig. 5). Overexpression of Neurog also induced 
ectopic migratory Asict BTN precursors (Fig. 3f, g), while BTNs were 
abolished through expression of a dominant repressor form of Neurog 
(Neurog::WRPW, Fig. 3h). In all cases, induced supernumerary BIN 
precursors migrated as an expanded chain of cells (Fig. 3e, g). These 
data indicate that sustained Neurog expression in caudal neural plate 
border cells is controlled by MAPK/ERK signalling and is necessary 
and sufficient for BTN specification, migration and differentiation. 
In vertebrates, neural crest EMT is effected in part through differen- 
tial cell adhesion, mediated by various mechanisms regulating cadherin 
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CESN 


Distal process ¢ 


Figure 4 | Synaptic connections of bipolar tail neurons. a, BTN 
labelled by Gad >unc-76::eGFP (green) contacting a CESN labelled by 
Slc17a6/7/8(Vglut) >unc-76::mCherry (blue). b, Proximal process of BTN 
labelled by Islet BTN >unc-76::mCherry (green) contacting motor neurons 
(MNs) labelled by Fgf8/17/18> unc-76::eGFP (magenta). c, Diagram of 
Ciona larva showing synaptic connections between CESNs in tail epidermis, 
BTNs and MNs. d, Two synaptic inputs (red boxes, insets) from the sheet- 
like profile of a CESN to a left-side BTN; transmission electron micrograph 
from wide-area montage. The profile of a second BTN axon lies out of view. 
Axon profiles from two right-side BTN axons (rs. BTNs) are visible. BTNs 
overlie the neural tube (NT) with neural canal (marked with an asterisk) 

and cross-sectioned cilia (arrow). An epidermal cell (Ep) overlies the BTN. 
Each synapse enlarged in inset has ~52 nm diameter presynaptic vesicles 
(white arrowhead), and the left synapse has a postsynaptic density (red 
arrowhead). Scale bars, 1 um (inset scale bars, 0.5 um). Not., notochord. 

e, Synaptic input (arrow) from a BTN to the axon of a member of the most 
anterior pair (A11.118) of motor neurons (MN1), identified by a cumulus of 
~60-nm presynaptic vesicles and a shallow postsynaptic density (arrowhead). 
A second input nearby originates from the axon of an eminens neuron” 
(arrow). Inset, synaptic input from BTN to the axon of a second pair of motor 
neurons (MN2). Scale bars, 1 um. 


function?!. We found that expression of Cadherin.b, the predominant 
cadherin gene expressed in the neural tube of Ciona embryos, is absent 
in BTN precursors (Fig. 3i). Moreover, BTN precursors do not express 
Protocadherin.c, a cadherin superfamily gene expressed in CESNs and 
epidermis midline (Fig. 3j). Overexpression of protocadherin.c protein 
inhibited delamination and migration of BTNs (Fig. 3k, 1), suggesting 
that Ciona BTNs and vertebrate neural crest share regulatory strategies 
for EMT via differential cell-cell adhesion. 

We observed that each BTN precursor initially migrates anteriorly 
with a prominent leading edge that becomes the cell’s anterior neurite 
(or ‘proximal process’), while its Golgi apparatus is located posterior to 
the cell nucleus. At around 12h post-fertilization, each BTN precursor 
undergoes a 180° polarity inversion, with the Golgi repositioning itself 
anterior to the nucleus immediately before the cell begins to elaborate 
the posterior segment of its neurite (the ‘distal process’), resulting in 
a bipolar morphology (Extended Data Fig. 6, Supplementary Video 4 
and Supplementary Table 1). These observations suggest that a precisely 
timed re-orientation of cell polarity underlies the characteristic bipolar 
morphology of the BINs. 

At hatching, BTN cell bodies are situated in the middle of the tail 
along the anterior—posterior axis, with their distal processes extending 
towards the tail tip and proximal processes projecting towards the 
motor ganglion and brain (Fig. 4a-c)'?. Electron microscopy con- 
firmed that the BTN somata lie outside the neural tube and are invar- 
iably overlain by epidermal cells (Fig. 4d). BT'Ns lack junctions with 
epidermal cells and also lack cilia, thus failing to penetrate the tunic 
to contact the exterior. These characteristics suggest that while distal 
BTN neurites may be sensory, their cell bodies lack epidermal sensory 
receptors found in CESNs~. Along the tail, the BT'Ns contact overlying 
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CESNs, the short processes of which do not reach the motor ganglion” 
(Fig. 4a—c). At these contacts, synapses form from the CESN to the 
BTNs (Fig. 4d). Unlike the CESNs, the proximal processes of the BTNs 
form synaptic contacts with the motor neurons that innervate and 
control the tail muscles (Fig. 4b, c, e). Each BTN establishes many such 
contacts upon the two most anterior pairs of motor neurons, MN1 
and MN2, on both the left and right sides (Fig. 4e and Extended Data 
Table 1). These synaptic connections are similar to those of mamma- 
lian slowly adapting type I DRGNSs that, in addition to being mech- 
anosensitive themselves, relay distinct inputs from mechanosensory 
Merkel cells of the epidermis”’. Both tunicate CESNs and vertebrate 
Merkel cells arise from non-migratory epidermal cells, require Atoh 
factors for their specification and are glutamatergic in their neuro- 
transmitter phenotype!®”°*+">, These data suggest that tunicate BTNs 
may thus be equivalent to vertebrate DRGNs within a homologous 
ascending sensory pathway (Fig. 4c). 

In anamniote vertebrates, evidence for a common progenitor of 
intramedullary Rohon-Beard neurons (RBNs) and neural crest, in 
addition to other similarities between RBNs and DRGNs, indicates 
a deep homology between these cell types”°. Fritzsch and Northcutt 
proposed that a key step in the evolution of neural crest was the elabora- 
tion of extramedullary sensory neurons from intramedullary RBN-like 
neurons”’. Following the Fritzsch-Northcutt model, the BTNs may 
be derived from an ‘intermediate’ extramedullary neuron that evolved 
in the last common ancestor of Olfactores (vertebrates and tunicates) 
before the appearance of bona fide neural crest in the vertebrates. The 
migration of BTN precursors along the paraxial mesoderm, similar to 
later phases of DRGN migration, suggests that some of the diverse EMT 
and migratory behaviours displayed by vertebrate neural crest cells may 
pre-date the emergence of vertebrates. 

Although the embryological origin (neural plate borders) and 
molecular signature (Neurog’/Islet*) of the BINs of Ciona also support 
homology with RBNs, the two do in fact differ in several key aspects. 
First, BTNs are extramedullary neurons derived from progenitor 
cells that migrate along paraxial mesoderm lateral to the neural tube. 
Second, expression of ASICs is shared between BT'Ns and DRGNs, 
but appears absent from RBNs”*. Finally, RBNs are multipolar with 
extensively branching peripheral neurites that innervate the overlying 
epidermis”’, while we have not observed any peripheral neurites pro- 
jecting from the bipolar/pseudounipolar BTNs. 

We have revealed the developmental history of migratory neu- 
ronal progenitors that arise from the neural plate borders of tunicate 
embryos. Based on their embryological origin, gene expression, cell 
behaviour, morphology and synaptic connections, we propose that the 
BTNs are homologous to neural-crest-derived DRGNs. This would 
imply that the neural plate borders of the olfactorean ancestor gave 
rise to at least two types of neural crest derivatives: pigment cells and 
peripheral neurons (Extended Data Fig. 7). 

In the invariantly developing Ciona embryo, the pigment cell and 
BTN lineages become separated early in development, but converge 
at a neural plate border cell identity before parting again towards dis- 
tinct differentiated fates. This separation between the two lineages may 
represent the ancestral condition of the neural plate borders before the 
evolution of the neural crest in vertebrates. This would support models 
that propose an evolutionary origin for vertebrate neural crest through 
a heterochronic shift or ‘intercalation of a multipotent progenitor state 
downstream of neural plate border specification but upstream of cell 
differentiation, based on shared regulatory programs between neural 
crest and pluripotent cells of the early embryo’”°. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Molecular cloning. Reporter constructs were designed based on information 
of cis-regulatory modules (CRMs) from previously published studies on the 
following genes: Islet*!, Msx*?, Neurog*’, Nodal*4, Asic\, glutamate decarboxy- 
lase (Gad)*®, Slc17a6/7/8 (Vglut)*> and Fef8/17/18 (ref. 36). The Neurog b-line 
CRM (Ciinte.REG.KhC6.1500090-1502346) was cloned using the following 
primers: Neurog —3,010 forward (5‘-GTCTGTTTCCGCATACATGC-3’) and 
Neurog —773 reverse (5’-CTTATACGCCGAACCTCATG-3’). The Neurog 
b-line minimal CRM (Ciinte.REG.KhC6.1500090-1500501) was found to be 
contained within this region and cloned using Neurog —3,010 forward and 
Neurog —2,599 reverse (5'-GCAAAACGTTTCCCGATTCG-3’) primers. 
Neurog CRMs were cloned upstream of the basal promoter of Neurog (Ciinte. 
REG.KhC6.1502506-1503107), cloned using the primers Neurog —594 forward 
(5'-GGTCATGCTTTGTTACGTCC-3’) and Neurog +9 reverse (5’-ATCCAACAT 
TTTGTAGCAAGAGC-3’), or the basal promoter of the Zfpm gene (also known as 
friend of GATA, or Fog)*”. The full-length Mrf CRM (Ciinte.REG.KhC14.4311719- 
4314636) was cloned using the primers (5’-GCAAGCTCCTTTGGGGTTTGG-3’) 
and (5/-CGTATAAATATGTCAAACTACCGGC-3’). Caenorhabditis elegans UNC-76 
tags were fused to fluorescent proteins to ensure even labelling of axons**. Probes 
used for in situ hybridization were transcribed in vitro from templates obtained 
from previously published gene collection clones*”” for Neurog (RICiGC29n04), 
Pax3/7 (R1CiGC42e20), Ebf (R1CiGC02i14) and Cadherin.b (VES104_F13) or 
cloned de novo from coding sequences for Snail (KH.C3.751.v1.C.SL1-1) and 
Protocadherin.c (KH.C9.32.v1.A.SL1-1). Golgi-targeting sequence was cloned 
from KH.C14.396.v1.B.ND1-1 cDNA (N-acetylgalactosaminyltransferase 7, or 
Galnt7) using the primers Galnt7 amino acid 1 forward (5’-ATGAGATTTAAAA 
TCGCATCAGTTTTG-3’) and Galnt7 amino acid 157 reverse (5’/-AAGTGATAT 
CTTGTCGCTGTTCAC-3’) and fused in-frame to fluorescent proteins. Neurog 
coding sequence and Neurog:: WRPW have been previously cloned and published*!. 
dnFGER has been previously published”, as has Su(H)-DBM*’. 

Embryo handling, in situ hybridization and immunolabelling. For purposes 
other than for electron microscopy (see below), eggs and embryos from wild- 
caught Ciona intestinalis (species type A, ‘robusta’) purchased from M-REP 
(San Diego, California) were handled according to established protocols“. Double 
in situ hybridization/immunolabelling was performed as described in previous 
publications*“°, Monoclonal anti-B-galactosidase (Promega catalogue number 
Z3781), rabbit polyclonal anti-mCherry (BioVision, accession number ACY24904), 
and Alexa Fluor-conjugated secondary antibodies (Life Technologies) were all used 
at 1:500 working dilution. Alexa Fluor-conjugated phalloidin (Life Technologies) was 
used at 1:50 working dilution. MEK inhibitor U0126 (Cell Signaling Technology) 
was resuspended as stock solution in DMSO at 10mM concentration, and diluted 
to 10uM in artificial sea water for embryo treatments. Sample sizes equal the total 
number of embryos present per microscope slide, unless these exceeded arbitrarily 
set limits of 50 or 100 embryos. No statistical methods were used to predetermine 
sample size and no replicates were used. The experiments were not randomized and 
the investigators were not blinded to allocation during experiments and outcome 
assessment. 

Fluorescence/confocal microscopy and photoconversion. Images were captured 
ona Leica inverted TCS SP8 X confocal or DM2500 epifluorescence microscope. 
For time-lapse image capture, embryos were imaged as they developed in sea water- 
filled chambers on coverslip-bottom Petri dishes (MatTek). Confocal image stacks 
were processed in Leica Application Suite or ImageJ. Video annotations were made 
using Camtasia software (TechSmith). 3D slices and projections were generated 
using Imaris (Bitplane) or Volocity (PerkinElmer) software. Kaede::nls*” was 
photoconverted as previously described**. Neurite lengths and Golgi apparatus 
positioning were measured using ImageJ. Not all cells, neurites and/or Golgi were 
visible in every embryo. Golgi positioning relative to BTN nuclei was measured 
in degrees of angle formed between a line traced anteriorly from the nucleus and 
another line traced through the middle of the Golgi complex. Thus, when the Golgi 
complex is perfectly aligned anterior to the nucleus, the angle is 0°, whereas if the 
Golgi complex is perfectly posterior to the nucleus, the angle is 180°. Rose plots 
(angle histograms) were generated in Matlab (http://www.mathworks.com/help/ 
matlab/ref/rose.html). 
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Electron microscopy. Adult animals, Ciona intestinalis (L.), were collected by 
P. Darnell from Mahone Bay, Nova Scotia. Two-hour larvae reared at 18°C in 
the dark were fixed at 4°C for 1h in 1% OsOy, in 1.25% NaHCO; adjusted to 
pH 7.2 with HCl, followed by 2% glutaraldehyde in 0.1 M phosphate buffer. After 
fixation they were embedded in Epon, and a single larva cross sectioned at 60nm 
in the motor ganglion and later at 100 nm down the length of the tail, and the 
sections post-stained for 5-6 min in freshly prepared aqueous urany] acetate fol- 
lowed by 2-3 min in lead citrate. Sections were viewed using an FEI Tecnai 12 
electron microscope operated at 80kV and images captured using either a Kodak 
Megaview II camera using software (AnalySIS: SIS GmbH), or a Gatan 832 Orius 
SC1000 CCD camera using Gatan DigitalMicrograph software to compile multi- 
panel montages from each section. Comprehensive electron micrograph series 
identified the cell bodies and axons of BTNs, motor neurons and CESNs from their 
positions and shapes, and these in turn enabled identification of their connections 
(K.R. and I.A.M., manuscript in preparation). 


31. Stolfi, A. et a/. Early chordate origins of the vertebrate second heart field. 
Science 329, 565-568 (2010). 

32. Russo, M. T. et al. Regulatory elements controlling Ci-msxb tissue-specific 
expression during Ciona intestinalis embryonic development. Dev. Biol. 267, 
517-528 (2004). 

33. Stolfi, A. & Christiaen, L. Genetic and genomic toolbox of the chordate Ciona 
intestinalis. Genetics 192, 55-66 (2012). 

34. Khoueiry, P. et al. A cis-regulatory signature in ascidians and flies, independent 
of transcription factor binding sites. Curr. Biol. 20, 792-802 (2010). 

35. Takamura, K., Minamida, N. & Okabe, S. Neural map of the larval central 
nervous system in the ascidian Ciona intestinalis. Zoolog. Sci. 27, 191-203 
(2010). 

36. Imai, K. S., Stolfi, A., Levine, M. & Satou, Y. Gene regulatory networks underlying 
the compartmentalization of the Ciona central nervous system. Development 
136, 285-293 (2009). 

37. Rothbacher, U., Bertrand, V., Lamy, C. & Lemaire, P. A combinatorial code 
of maternal GATA, Ets and B-catenin-TCF transcription factors specifies 
and patterns the early ascidian ectoderm. Development 134, 4023-4032 
(2007). 

38. Dynes, J. L. & Ngai, J. Pathfinding of olfactory neuron axons to stereotyped 
glomerular targets revealed by dynamic imaging in living zebrafish embryos. 
Neuron 20, 1081-1091 (1998). 

39. Satou, Y. et al. AcDNA resource from the basal chordate Ciona intestinalis. 
Genesis 33, 153-154 (2002). 

40. Roure, A. et al. A multicassette Gateway vector set for high throughput and 
comparative analyses in Ciona and vertebrate embryos. PLoS ONE 2, e916 
(2007). 

Al. Stolfi, A. Wagner, E., Taliaferro, J. M., Chou, S. & Levine, M. Neural tube 
patterning by Ephrin, FGF and Notch signaling relays. Development 138, 
5429-5439 (2011). 

42. Davidson, B., Shi, W., Beh, J., Christiaen, L. & Levine, M. FGF signaling 
delineates the cardiac progenitor field in the simple chordate, Ciona intestinalis. 
Genes Dev. 20, 2728-2738 (2006). 

43. Hudson, C. & Yasuo, H. A signalling relay involving Nodal and Delta ligands acts 
during secondary notochord induction in Ciona embryos. Development 133, 
2855-2864 (2006). 

44. Christiaen, L., Wagner, E., Shi, W. & Levine, M. The sea squirt Ciona intestinalis. 
Cold Spring Harb. Protoc. pdb.emo138 (2009). 

45. Beh, J., Shi, W., Levine, M., Davidson, B. & Christiaen, L. FoxF is essential for 
FGF-induced migration of heart progenitor cells in the ascidian Ciona 
intestinalis. Development 134, 3297-3305 (2007). 

46. Ikuta, T. & Saiga, H. Dynamic change in the expression of developmental genes 
in the ascidian central nervous system: revisit to the tripartite model and the 
origin of the midbrain-hindbrain boundary region. Dev. Biol. 312, 631-643 
(2007). 

47. Ando, R., Hama, H., Yamamoto-Hino, M., Mizuno, H. & Miyawaki, A. An optical 
marker based on the UV-induced green-to-red photoconversion of a 
fluorescent protein. Proc. Natl Acad. Sci. USA 99, 12651-12656 (2002). 

48. Razy-Krajka, F. et al. Collier/OLF/EBF-dependent transcriptional dynamics 
control pharyngeal muscle specification from primed cardiopharyngeal 
progenitors. Dev. Cell 29, 263-276 (2014). 

49. Nishida, H. Cell division pattern during gastrulation of the ascidian, Halocynthia 
roretzi. Dev. Growth Differ. 28, 191-201 (1986). 

50. Bone, Q. The central nervous system in amphioxus. J. Comp. Neurol. 115, 
27-64 (1960). 


© 2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 1 | In situ hybridization of neural plate border b, Double in situ hybridization for Snail (green on merged image) and Msx 
markers Snail and Msx. a, Immunolabelling for B-galactosidase (red) and (red on merged image) in stage 12 embryos counterstained with DAPI 

in situ hybridization for Snail mRNA (green) in stage 12 embryo (blue on merged image), showing co-expression in neural plate border cells, 
electroporated with Msx>lacZ, revealing Snail expression in the BTN including BTN progenitors. Scale bars, 25 um. 


progenitors (b9.36 cells, arrowheads). Dashed area enlarged in a’. 
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Extended Data Figure 2 | Lineage tracing of b9.36 descendants. 

a, Photoconversion of Kaede::nls driven by the Msx driver was used to follow 
the cell divisions of the BTN progenitors from the late gastrula stage to the 
early tailbud stage. Both b10.71 and b10.72 divide once. b11.141 will give rise 
to a definitive anterior BTN (see Extended Data Fig. 4). Numbers in each 
panel represent time in minutes elapsed from the initial photoconversion 
event. Scale bar, 50 um. b, Lineage tree showing specification of aBTNs in 
relation to other cells of the posterior neural plate borders. For simplicity, 


only one side of the embryo is depicted. c, Lateral view of a 110-cell-stage 
embryo showing the positions of blastomeres in b. Red lines connect 

sibling cells. d, Dorsal view of a neurula-stage embryo showing zippering 

of posterior neural-plate-border-derived capstone cells'® as neural tube 
closure is initiated. Panels b and d are courtesy of H. Hashimoto and FE. Robin 
(University of Chicago) and N. Takatori (Tokyo Metropolitan University), 
and partially modelled after ref. 17. Panel c modelled after ref. 49. 
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Extended Data Figure 3 | Neurog cis-regulatory sequences. a, Schematic b-line (red) reporter constructs. Reporter co-expression is seen in b9.36 
diagram representing Neurog locus and 5’ cis-regulatory sequences including descendants on either side of the neural plate. Neurog expression also marks 
b-line and b-line minimal cis-regulatory modules. Peaks represent nucleotide tail-tip lineages of uncertain provenance, previously reported to be descended 
sequence conservation with Ciona savignyi genome. b, Late gastrula from b8.21 (ref. 10). Scale bar, 25 um. ¢, Neurog b-line reporter. d, Neurog 
embryo (stage 13) electroporated with full-length Neurog (blue) and Nodal b-line minimal reporter. Scale bars in ¢, d, 50m. 
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Extended Data Figure 4 | Spatiotemporal restriction of Neurog expression. 
a, Lateral view of in situ hybridization (ISH) for Neurog (green) in embryo 
electroporated with Neurog b-line >H2B::mCherry (red) shows that Neurog 
expression is selectively maintained in only a subset of initially Neurog- 
expressing neural plate border cells. a’, In the b9.36 lineage, the anterior-most 
cell (b11.141, solid arrowhead) is always the sole one to express Neurog at 

this stage, and will go on to become the anterior BIN. Dashed arrowhead 
indicates b11.142, the sister cell of b11.141, which has downregulated 

Neurog relative to its sibling. b, b’, The identities of the cells in the tail tip 
(presumed b8.21-derived) lineages are unclear, but Neurog is similarly 
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restricted (arrowheads) to a single cell on either side of the midline, which 
we interpret as the definitive posterior BIT'Ns. c, Control embryo treated 
with DMSO vehicle, showing wild-type pattern of Neurog expression only 
in b11.141. d, Neurog is expanded to b11.142 upon treatment with the MEK 
inhibitor U0126 at 7h.p.f. This condition also results in specification of 
supernumerary BTNs, presumably due to expanded Neurog expression (see 
text for details). Thus, downregulation of Neurog in b11.142 also requires 
MEK/ERK signalling. e, Diagram of the aBTN lineage, descended from the 
b8.18 blastomere. Scale bars in a, b, 25 um. Scale bars in c, d, 10 um. 
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Extended Data Figure 5 | Perturbation of Notch signalling does not 

alter Neurogenin expression or bipolar tail neuron specification and 
differentiation. a, Top, lateral view of a stage 23 embryo electroporated 

with Msx >H2B::mCherry (magenta nuclei), Neurog b-line >unc-76::eGFP 
(green) and Msx >nls::lacZ, serving as the wild-type control condition. 
Bottom, embryo electroporated with same reporters as upper panel, plus 
Msx >Su(H)-DBM, which encodes a DNA-binding mutant form of the Notch 


b 
(RCs TR TS rg Ge 


Control 


Neurog b-line>Su(H)-DBM 


co-activator Rbpj. No discernable difference in Neurog activation or BIN 
specification was observed between control and Su(H)-DBM conditions 

(1 of 32 versus 2 of 42 embryos showing ectopic Neurog* BTNs, respectively). 
b, Late overexpression of Su(H)-DBM using the Neurog b-line driver similarly 
did not alter BTN specification/differentiation, as monitored by Asic >unc- 
76::eGFP reporter expression (0 of 50 control versus 0 of 50 Su(H)-DBM 
embryos showed ectopic Asic* BTNs). Scale bars, 50 um. 
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Extended Data Figure 6 | Cell polarity and morphogenesis of bipolar 
tail neurons. a, Embryo at 11.5h.p.f. (18°C) with BTNs displaced from 
clonally related epidermal cells (epid.) labelled by UNC-76::VenusY FP 
(red), Galnt7AC::CFP (green), and H2B::mCherry (blue) driven by Neurog 
b-line cis-regulatory module. Targeted localization of CFP by the Galnt7 
N-terminal signal sequence reveals polarized subcellular distribution of 
Golgi apparatus on posterior side of BTN nuclei as migration and proximal 
process extend in an anterior direction. This is distinct from the apical 
(dorsal) location of the Golgi apparatus in epidermal cells. b, Embryo at 
12.5h.p.f. (18°C) showing 180° inversion of Golgi apparatus localization 

to the anterior side of the nucleus, immediately preceding distal process 
extension. Scale bars in a, b, 50 um. ¢, Still frames from a confocal image 
stack time lapse movie (Supplementary Video 4) showing inversion of Golgi 


rs | 


———— 


average process lengths 


ant. | 


46 um, n=24 


Golgi orientation 


complex (Galnt7AC::VenusYFP, green) relative to nuclei (H2B::mCherry, 
red) in migrating BI'Ns. Time lapse imaging initiated at 11.5h.p.f. (18°C). 
Time in minutes elapsed from start shown at bottom right of each panel. 
Anterior BTN (aBTN) indicated by magenta arrowhead, posterior BTN 
(pBTN) indicated by white arrowhead. Scale bar, 25 um. d, Diagram showing 
correlation of average length of proximal (left) and distal (right) processes 
and angle of Golgi apparatus location relative to cell nucleus along the 
anterior—posterior axis in BTNs at different time points. Locations of Golgi 
apparatus represented by rose plots of bins of 20° spanning anterior (0°) and 
posterior (180°) endpoints around dorsal edge of BTN nucleus. Bin diameters 
indicate number of cells. Embryos analysed belong to the same pool as 
embryos in a and b. See Supplementary Table 1 for source data. 
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Extended Data Figure 7 | Proposed evolution of neural crest through 

the acquisition of multipotency by neural plate border cells. a, Cartoon 
diagram depicting a hypothetical path for neural plate border and neural crest 
evolution, starting with the reconstructed last common olfactorean ancestor, 
which could have had neural plate borders lined with committed progenitor 
cells giving rise to several pigmented ocelli and BTN-like peripheral neurons, 
a condition that may be conserved in extant cephalochordates*’. These cells 
would have been reduced in the highly miniaturized embryos of extant 
tunicates, while vertebrates are proposed to have co-opted a mesenchymal, 
multipotency program to bestow these cells with the potential to give rise 

to pigment cells, peripheral neurons or other derivatives, after a prolonged 
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period of EMT and migration. b, Diagram representing idealized cell lineages 
in the neural plate borders of tunicate and hypothetical urolfactorean 
ancestor, in which segregated lineages at the neural plate borders give 

rise to committed pigment cell or peripheral neuronal progenitors. 

c, Diagram of simplified neural crest cell lineage deploying a multipotency 
program downstream of neural plate border specification and upstream 

of cell differentiation. Thus, neural crest cells could have evolved through 
redeployment of a multipotency program (intercalation hypothesis)', or 
through its maintenance from earlier embryonic stages (heterochrony 
hypothesis)”. 
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Extended Data Table 1 | Synaptic input from bipolar tail neurons to motor neurons, identified by electron microscopy 


Postsynaptic motor 
neuron identity 


MN1 Left (A11.118) 


MN1 Right (A11.118) 


MN2 Left (A10.57) 


MN2 Right (A10.57) 


MN3 Left 


MN4 Left 


MN4 Right 


MN5 Left 


MN5 Right 


Synapse 
partnership 
BTN1-->MN1L 
BTN3-->MN1L 

Total 
BTN1-->MN1R 
BTN2-->MN1R 
BTN3-->MN1R 
BTN4-->MN1R 
Total 
BTN1-->MN2L 
BTN3-->MN2L 
Total 
BTN2-->MN2R 
BTN4-->MN2R 
Total 
BTN1-->MN3L 
Total 
BTN1-->MN4L 
Total 
BTN2-->MN4R 
BTN4-->MN4R 
Total 
BTN1-->MN5L 
Total 
BTN4-->MN5R 
Total 


Number of 
synapses 
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Total number of sections 
with synaptic profile 


134 
88 


163 
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BTN, bipolar tail neuron. MN, motor neuron. Axons of BTN1 and BTN3 lie on the left hand side of the embryo, and BTN2 and BTN4 on the right. The axons are not traced to their somata to indicate which would 


be anterior and posterior. 
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Decapentaplegic and growth control in the 
developing Drosophila wing 


Takuya Akiyama! & Matthew C. Gibson! 


As a central model for morphogen action during animal 
development, the bone morphogenetic protein 2/4 (BMP2/4)- 
like ligand Decapentaplegic (Dpp) is proposed to form a long- 
range signalling gradient that directs both growth and pattern 
formation during Drosophila wing disc development!~°. 
While the patterning role of Dpp secreted from a stripe of cells 
along the anterior-posterior compartmental boundary is well 
established!*®, the mechanism by which a Dpp gradient directs 
uniform cell proliferation remains controversial and poorly 
understood’-!’. Here, to determine the precise spatiotemporal 
requirements for Dpp during wing disc development, we use 
CRISPR-Cas9-mediated genome editing to generate a flippase 
recognition target (FRT)-dependent conditional null allele. By 
genetically removing Dpp from its endogenous stripe domain, we 
confirm the requirement of Dpp for the activation of a downstream 
phospho-Mothers against dpp (p-Mad) gradient and the regulation 
of the patterning targets spalt (sal), optomotor blind (omb; also 
known as bifid) and brinker (brk). Surprisingly, however, third-instar 
wing blade primordia devoid of compartmental dpp expression 
maintain relatively normal rates of cell proliferation and exhibit only 
mild defects in growth. These results indicate that during the latter 
half of larval development, the Dpp morphogen gradient emanating 
from the anterior—posterior compartment boundary is not directly 
required for wing disc growth. 

Morphogens, signalling molecules secreted from a localized source to 
form gradients of activity, are proposed to coordinately control growth 
and patterning in diverse organismal systems?**!*~!7, In Drosophila 
melanogaster, the BMP2/4-like ligand Dpp is highly expressed in a row 
of cells at the anterior—posterior (A/P) compartment border in third- 
instar wing discs (72-120h after egg laying (AEL); Fig. la). The 
secreted ligand is proposed to emanate from this position to form a 
long-range gradient that directs uniform growth and concentration- 
dependent patterning’®. Although the requirements for Dpp in disc 
patterning are widely accepted!*, precisely how a gradient of Dpp 
might direct homogenous cell proliferation is controversial’"!°. The 
general requirements for Dpp in growth are clear; imaginal discs fail 
to develop in dpp mutant larvae’ and ectopic expression of Dpp is 
sufficient to trigger overgrowth in lateral regions of the wing disc’. 
Nevertheless, owing to a lack of methods for the detection and disrup- 
tion of Dpp, the mechanism by which its downstream activity gradient 
directs uniform cell proliferation remains unknown. 

To visualize Dpp directly, we first generated a polyclonal antibody 
(anti-Dpp) that recognizes the Dpp prodomain on western blots and 
labels the expected compartmental stripe domain in fixed tissues 
(Fig. 1b, cand Extended Data Fig. 1). Next, to validate the specificity 
of anti-Dpp, we induced mitotic cell clones homozygous for the hypo- 
morphic allele dpp””? (ref. 20). As expected, dpp“’” mutant clones that 
impinged on the stripe domain correlated with a pronounced reduc- 
tion of Dpp and p-Mad levels, indicating that both Dpp expression 
and the activity gradient downstream of Dpp signalling were abolished 


(Dpp n=49, p-Mad n = 32; Extended Data Fig. 2). Surprisingly, how- 
ever, loss of dpp at the compartmental boundary had minimal effects 
on growth and cell proliferation in third-instar discs (n = 80; Extended 
Data Fig. 2). We extended these results by generating larger dpp*”” 
mutant clones using the Minute technique (M(2)25A; Fig. 1d-k)?)””. 
Consistent with our conventional clonal analysis, extensive loss of the 
Dpp stripe (n = 24) and its associated p-Mad activity gradient (n =20) 
did not strongly affect proliferation in adjacent cell populations (n = 44; 
Fig. 1d—k). Indeed, only when clones were induced very early in devel- 
opment did we observe severely reduced wing discs. 

The results described earlier indicate that the compartmental stripe 
of Dpp expression is not essential for growth in third-instar wing 
discs. However, dpp“!? mutant clone analysis entails some limitations, 
including the hypomorphic nature of the lesion and the lack of pre- 
cise spatiotemporal control over clone induction. Thus, to eliminate 
Dpp expression more precisely from the entire stripe domain, we used 
CRISPR-Cas9-mediated homologous recombination to generate a con- 
ditional null allele that harbours FRT sequences flanking the first dpp 
coding exon (dpp'?°"* (dpp*°); Fig. 2a, Extended Data Fig. 3a and 


120 h AEL 


a 72nhAEL 


Figure 1 | dpp”? mutant clones have little effect on wing disc growth. 


a, Wing discs grow dramatically during the third larval instar (72-120h 
AEL). dpp-expressing cells are visualized with dpp-GAL4 > UAS-GFP 
(red). Scale bar, 100m. b, Anti-Dpp recognizes the Dpp precursor 

(Pre) and prodomain (Pro) in western blots. An arrowhead indicates a 
previously unidentified Dpp product. Asterisks indicate non-specific 
bands. a-Tubulin (Tub): loading control. c, Dpp expression in wild-type 
wing disc. Scale bar, 100 um. d-k, dpp“!? mutant clones. Control (d, e, h, i) 
and dpp“? mutant (f, g, j, k) clones were stained with anti-Dpp (d-g) 

and anti-p-Mad (h-k). Phospho-histone 3 (p-H3; white) labels mitotic 
cells (d, f). Scale bars, 50 tm. Anterior is to the left in all figures. 
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show varying degrees of size reduction after 
heat shock at the time points indicated. k-1, Dpp 
stripe expression is maintained in controls 

(k) but eliminated from the wing blade region 
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(1). Scale bars, 100 um. m, Size comparison 
between wing discs of the genotypes indicated. 
Mean + standard deviation (s.d.). *P < 0.001, 
not significant (NS), two-sided Student's t-test. 
n-o, Mitotic cells in dpp’°/+; dpp-GAL4/UAS- 
FLP controls (n) and dpp*°/dpp"°; dpp-GAL4/ 
UAS-FLP (0) wing discs labelled with anti-p-H3 
(green) and anti-Dpp (red). Scale bar, 50 um. 

p, Mitotic index in dpp"°/+; dpp-GAL4/UAS- 
FLP controls (n =57) and dpp*°/dpp"°; dpp- 
GAL4/UAS-FLP (n= 54) wing discs. Mean + s.d. 
*P < 0.001, not significant (NS), two-sided 
Student's t-test. 
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Methods). The dpp’° allele is homozygous viable, and thus dpp-null 
mutant cells can be generated in precise spatial and temporal patterns 
by using controlled expression of the FLP recombinase to direct exci- 
sion of the genomic region between the FRT sites (Fig. 2b and Extended 
Data Figs 3b, 4a—d). Similar to previously characterized dpp mutants, 
hs-FLP; dpp*°/dpp*® larvae exhibited severely reduced wing discs and 
loss of anti-Dpp staining when subjected to heat-shock-induced disrup- 
tion of dpp during early larval development at 48h AEL (Fig. 2c—h, m). 
We observed a similar loss of Dpp but less pronounced growth defects 
after Dpp removal at later time points (72 and 96h AEL), indicating 
that there is a continuous requirement for dpp expression during wing 
disc development (Fig. 2i-j, m). 

To test specifically the requirements for the Dpp morphogen gradient 
in disc growth, we used a disc-specific dpp-GAL4 to drive expression 
of FLP in the compartmental stripe domain of dpp*°/dpp*° wing discs 
(Fig. 2b). Under these conditions Dpp protein was eliminated from the 
A/P compartmental boundary throughout the wing blade primordium 
(n=40; Fig. 2k, land Extended Data Fig. 4e-k). Strikingly, however, the 
affected discs were grossly normal in both size and overall morphology 
(Fig. 2k-m). Some residual Dpp expression was detected in the poste- 
rior hinge, part of the ventral-anterior hinge and in some peripodial 
cells in which dpp-GAL4 is not expressed (Fig. 21, arrowheads, and 
Extended Data Fig. 4i-k). In addition, small clusters of cells express- 
ing Dpp were frequently observed in proximity to the dorso-ventral 
boundary, perhaps due to reduced Gal4 expression (n = 27/40; Fig. 21, o 
and Extended Data Fig. 4i-k). However, consistent with the results of 
dpp”"? mosaic analyses (Fig. 1d-k and Extended Data Fig. 2), disruption 
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of the Dpp stripe caused only a mild growth defect relative to controls 
(7% reduction of area; Fig. 2k-m), without any obvious effect on cell 
proliferation (Fig. 2n—p). While we cannot rule out a contribution of 
residual Dpp to the growth of dpp*°/dpp"°; dpp-GAL4/UAS-FLP wing 
discs, these experiments suggest that there is no instantaneous growth 
requirement for the canonical Dpp gradient centred on the A/P com- 
partment boundary of the wing blade primordium. 

To address the kinetics of dpp disruption in dpp*°/dpp"°; dpp-GAL4/ 
UAS-FLP discs, we monitored Dpp expression and p-Mad activity at 
72, 96 and 120h AEL (Fig. 3). Compared with controls (Fig. 3a, c), the 
compartmental stripe of Dpp and its associated p-Mad activity gradi- 
ent were disrupted in dpp*°/dpp'°; dpp-GAL4/UAS-FLP wing discs 
dissected and fixed at 72h AEL (Fig. 3b, d). The loss of Dpp and its 
activity gradient became more pronounced by 96h AEL, and persisted 
as discs grew to a relatively normal size by 120h AEL. Combined, these 
observations are consistent with the results of dpp“!? clonal analysis 
(Fig. 1d-k and Extended Data Fig. 2) and further support the conclu- 
sion that a continuous Dpp gradient centred on the compartmental 
stripe is not essential for proliferative growth in the third larval instar. 

During wing disc development, p-Mad levels peak at the compart- 
mental boundary and gradually diminish laterally (Fig. 4a)!?. Upon 
phosphorylation, Mad translocates to the nucleus to regulate the tran- 
scriptional targets Sal, Omb and Brk (Fig. 4b-d), which in turn define 
the positions of the vein primordia visualized by anti-Delta (Dl) stain- 
ing (Fig. 4e)*°. In agreement with this general model for wing pattern- 
ing downstream of Dpp, removal of the compartmental Dpp stripe 
led to a pronounced loss of the p-Mad gradient (n = 37; Fig. 4f) and 
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Figure 3 | The Dpp gradient is not essential for growth in third instar 
wing discs. a, b, Stripe Dpp expression is present in dpp*?/+; dpp-GAL4/ 
UAS-FLP controls (a) but eliminated from dpp'°/dpp?°; dpp-GAL4/ 
UAS-FLP (b) wing discs during the third larval instar (72-120h AEL). 


the corresponding Sal (n = 25; Fig. 4g) and Omb expression domains 
(n=53; Fig. 4h). In addition, Brk expression was de-repressed through- 
out the wing pouch (n = 25; Fig. 4i) and patterning of presumptive 
vein territories was disrupted in discs devoid of the Dpp stripe (n= 20; 
Fig. 4j). Intriguingly, in the course of these experiments we also noted 
that dpp disruption in dorsal cells led to a compartment-specific loss 
of p-Mad that was not rescued by Dpp of ventral origin (Extended 
Data Fig. 5). Likewise, dpp“!? mutant clones adjacent to the dorsal- 
ventral border resulted in a compartment-specific loss of p-Mad stain- 
ing (Fig. 1j, k and Extended Data Fig. 2g, h), suggesting either that Dpp 
protein is not able to cross the dorso-ventral boundary or that Dpp 
movement is directionally regulated along the disc A/P axis. 

Taken together, our results confirm that dpp is a crucial regulator 
of wing disc pattern formation (Fig. 4a-j) but also demonstrate that 
the canonical morphogen gradient is not continuously required for 
growth and cell proliferation as the disc doubles in size during the third 
larval instar (Figs 2k—p, 3 and Extended Data Fig. 4f-k). This implies 
that the requirement for dpp in disc growth could either be fulfilled by 
earlier functions of the pathway or by a cellular source of Dpp outside 
the classical stripe domain. An important possibility in dpp*°/dpp"°; 
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c, d, The p-Mad activity gradient observed in dpp'°/+; dpp-GAL4/UAS- 
FLP controls (c) is abolished in dpp’°/dpp"°; dpp-GAL4/UAS-FLP wing 
discs during the third larval instar (d). Scale bars, 100 um. 


dpp-GAL4/UAS-FLP discs is that an initial burst of Dpp expression at 
the compartment boundary could precede FLP-mediated excision of 
the dpp locus and provide a sufficient stimulus to initiate early disc 
growth. Nevertheless, we found that global inactivation of dpp gener- 
ated growth defects even at relatively late time points, consistent with 
a continuous requirement (Fig. 2h-j). To probe potential sources of 
Dpp expression outside of the stripe domain, we eliminated Dpp from 
defined spatial territories using the Gal4/UAS system. While loss of 
Dpp expression from whole discs or anterior cells alone elicited severe 
growth phenotypes, elimination of Dpp from posterior cells showed 
little or no effect (Extended Data Fig. 6a—d)*°. Phenotypes caused by 
Dpp elimination with both ap- and nub-GAL4 were consistent with 
the interpretation that Dpp produced by anterior cells is necessary for 
wing disc growth (Extended Data Fig. 6e, f), even though the com- 
partmental stripe itself was not essential (Fig. 21, 0). To assess the 
temporal requirements for dpp in anterior cells, we used GAL80* to 
repress ci-GAL4 activity (Fig. 4k-v and Extended Data Fig. 7)”°. In wing 
discs from dpp*°/dpp*°, ci-GAL4, UAS-GFP; UAS-FLP, tub-GAL80"/+ 
larvae, Dpp expression was generally disrupted within 24h of removing 
Gal80-mediated repression of Gal4 (n = 34/40; Fig. 40, p). Inactivation 


Dl Figure 4 | The Dpp stripe is crucial for wing 
Doe pattern formation. a—j, The p-Mad activity 
gradient and BMP-dependent gene expression 
domains observed in controls (a-e) are severely 
disrupted following loss of the Dpp gradient in 
dpp"°/dpp"°; dpp-GAL4/UAS-FLP wing discs 
(f-j). Scale bar, 50 um. k-v, Dpp elimination 
from the anterior compartment of wing discs 
at a series of time points before dissection. In all 
cases, dpp"°/dpp"®, ci-GAL4, UAS-GFP; UAS-FLP, 
tub-GAL80"/-+ wing discs were stained for 
anti-Dpp (red) and DNA (blue). Green 
fluorescent protein (GFP; green) indicates 
de-repressed Gal4 activity. Compared with 
controls (k, 1), normal Dpp expression is 
maintained for 18h after temperature shift 
(m, n; n =33) but mostly eliminated within 24h 
(0, p; 2 =34/40). Normal Dpp staining is lost 
after temperature shifts at 36 (q, r; n = 33), 48 
(s, t; n= 80) and 72 (u, v; 1 =43)h before 
dissection. Scale bar, 100 um. 
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of anterior dpp at earlier larval time points through temperature shifts 
caused the most severe growth defects (Fig. 4s—v), but even late inacti- 
vation resulted in mildly reduced disc size (Fig. 4q, r). Although these 
experiments offer limited temporal resolution, the results indicate that 
Dpp produced within the ci-GAL4 expression domain is required for 
wing disc growth throughout development. By inference, this would 
indicate that Dpp expressed within the anterior compartment is suffi- 
cient to sustain homogenous disc growth in the absence of the canon- 
ical morphogen gradient during the third larval instar. 

Drosophila Dpp has long served as a central paradigm for under- 
standing how morphogens regulate growth and patterning during ani- 
mal development. To date, several distinct models have been proposed 
for imaginal disc growth control by the Dpp activity gradient”"'°. We 
directly tested the requirements for a gradient of Dpp signalling dur- 
ing wing disc development. Consistent with established dpp mutant 
phenotypes and the critical temporal requirement for Dpp during early 
development”’, eliminating Dpp throughout the disc at early larval 
stages caused severe growth defects (Fig. 2h). Surprisingly, however, 
abrogation of the compartment-boundary-centred Dpp signalling 
gradient did not disrupt active cell proliferation and elicited only mild 
growth defects during the third larval instar (Figs 2k—p, 3 and Extended 
Data Fig. 4f-k). In summary, while Dpp is clearly required for disc 
growth, we propose that the classical Dpp morphogen gradient pri- 
marily regulates pattern formation and is not continuously required 
to drive proliferative growth in the latter half of larval development. 
These findings suggest dynamic spatial and temporal requirements 
for Dpp. Expanding on our results, we speculate that other morphogen 
systems may utilize a similar strategy to coordinate growth and pattern- 
ing during organ and appendage development. 
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METHODS 

Dpp antibody production. dpp complementary DNA was cloned into 
pET-DEST42 (Invitrogen) using the GATEWAY system. The expression of Dpp- 
His6 was induced by the addition of 1 mM isopropylthiogalactoside (IPTG) at an 
ODe00 nm Of 0.5 in 1 | of LB media at 37 °C. After a 3h induction, cells were harvested, 
resuspended in 50 ml of denaturing buffer (50 mM Tris-HCl, 1 M NaCl, 20 mM 
imidazole, 6 M urea, and 0.1% Triton X-100, pH 7.5) and disrupted by sonication. 
The cell lysate was applied to a Ni Sepharose column for protein purification (GE 
Health Care Life Sciences). The column was washed (50 mM Tris-HCl, 1 M NaCl, 
6M urea, and 0.1% Triton X-100, pH 7.5) and DPP-His6 proteins were eluted with 
2 ml of elution buffer (50 mM Tris-HCl, 1M NaCl, 500 mM imidazole, 6M urea, 
and 0.1% Triton X-100, pH 7.5). The purified protein was renatured by step-wise 
reduction of urea concentration. The soluble fraction of the purified Dpp—His6 
protein was used to immunize two rabbits pre-screened for low serum immu- 
noreactivity against imaginal discs, one of which produced suitable antibodies. 
Rabbit anti-sera against Dpp—His6 were affinity purified using the Dpp prodomain 
(amino acids 1-456). 

Clonal analyses of dpp“’* mutant cells in wing discs. All flies were maintained 
using standard cornmeal media at 25 °C. dpp“” is a strong hypomorphic allele that 
carries a chromosomal inversion in the dpp“” enhancer region”°. Mutant clones 
were generated by the FLP-FRT method”’ and marked by the absence of GFP 
according to the following schemes. Control cross: y,w,hs-FLP; ubi-GFP.FRT40A/ 
CyO x w; FRT40A; experimental cross: y,w,hs-FLP; ubi-GFPR.FRT40A/CyO x 
w; dpp"!, FRT40A/CyO,twi-GAL4, UAS-GFP. 

dpp*”? clones in a Minute background?!” were generated as follows. Control 
cross: y,w,hs-FLP; ubi-GFP.M(2)25A,FRT40A/CyO x w; FRT40A; experimental 
cross: y,w,hs-FLP; ubi-GFP.M(2)25A,FRT40A/CyO x w; dpp"!?, FRT40A/CyO, 
twi-GAL4, UAS-GFP. Clones were induced at 37°C for 2h at 48h AEL (standard) 
or 72h AEL (Minute). 

Western blot analysis. Wing discs were homogenized in SDS sample buffer 
and boiled. Protein samples were subjected to 4-20% Mini-PROTEAN TGX gel 
(Bio-Rad) or 10% SDS-PAGE gel electrophoresis and then analysed by west- 
ern blot with SuperSignal (Thermo). Rabbit anti-Dpp (1:1,000), mouse 
anti-a-tubulin (1:1,000, Sigma) and horseradish peroxidase (HRP)-conjugated 
secondary antibodies (1:10,000, Jackson ImmunoResearch) were used for 
detection. 

Generation of an FLP/FRT-mediated conditional dpp-null allele. pBFv-U6.2 
(ref. 28) was used for making ssRNA DNA constructs. The following primers were 
annealed and cloned into the BbsI site of pBFv-U6.2. sgRNA construct 1: 5’‘-CTT 
CGGTTCGGATGTGGACCGGAA-3’, 5’-AAACTTCCGGTCCACATCCGAA 
CC-3’; sgRNA construct 2: 5’-CTTCGGACAGAAGGATCTAGGGAT-3’, 5/-AA 
ACATCCCTAGATCCTTCTGTCC-3’. 

To generate a ubi-GFP selection cassette, the 1,760 bp promoter region of 
ubiqutin-63E was amplified from w'!!8 genomic DNA using 5‘-GGCGGCGAA 
TTCATCAGTACTGTCCAAAATCGAAAATCGCCGAACCG-3’ and 5’-GGC 
GGCGGTACCTTTGGATTATTCTGCGGGAAGAAAATAGAGATGTGG-3' 
primers with EcoRI and KpnI sites at the 5’ and 3’ ends, respectively (restriction 
enzyme sites in primers are in bold). Then, the PCR product was cloned into the 
EcoRI and KpnI sites of pH-Stinger”’. 

The Gibson assembly technique (NEB) was employed to obtain a donor DNA 
for CRISPR-Cas9-mediated homologous recombination. First, five PCR fragments 
were prepared using the following primer sets. Fragment 1 (pHSG298 vector): 
5'-CCGGGTACCGAGCTCGAA-3’, 5’-GGATCCTCTAGAGTCGACCTG-3’; 
fragment 2 (left arm homology-FRT 5’): 5/-AGGTCGACTCTAGAGGATCCCG 
AAAGATCCCTTTGCGC-3’,5’-TTATGATATCGAAGTTCCTATACTTTCTA 
GAGAATAGGAACTTCGGAATAGGAACTTCGAATGGAATCGCGTTCGT 
ATTCCACTCAATCC-3’; fragment 3 (loxP 5’-ubi-GFP selection cassette-loxP 
3’): 5’-TAGGAACTTCGATATCATAACTTCGTATAGCATACATTATACGAA 
GTTATTCGCCAAGCTTGGGCTGCATCACGTAATAAGTGTGCGTTG-3’, 
5’-ATGTGGACCGATAACTTCGTATAATGTATGCTATACGAAGTTATTTA 
ACTTACATACATACTAGAATTGATCGGCTAAATGGTATGGC-3’; fragment 
4 (dpp-FRT 3’): 5‘-ACGAAGTTATCGGTCCACATCCGAACCC-3’, 5’-AGGAT 
CTAGGGAAGTTCCTATACTTTCTAGAGAATAGGAACTTCGGAATAGGAA 
CTTCCATATGGATCGGCAGGTATGCAAATCGCTTAG-3’; fragment 5 (right 
arm homology): 5'-TAGGAACTTCCCTAGATCCTTCTGTCCTCG-3’, 5’-ATT 
CGAGCTCGGTACCCGGGCGGGAATGCTCTTCACGTC-3’. 

After the PCR reaction, all fragments were purified using Zymoclean Gel DNA 
Recovery kit (Zymo research) and subjected to Gibson assembly (NEB). 

After confirming the DNA sequences, these plasmid DNAs were mixed, 
precipitated by ethanol precipitation and dissolved in nuclease-free water with 
food dye at 250ng ul! for each DNA construct. The DNA mixture was injected 
into the posterior side of embryos obtained from a cross of w!!78 and y,cho,v; 
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attP40{nos-Cas9}/CyO (ref. 28). Transgenic flies were first selected by GFP expres- 
sion and were further confirmed by Southern blots, PCR and DNA sequencing. 
Genomic Southern blot analysis. Genomic DNAs from w!!/8 and w; dpp'O-CF?/ 
CyO adults were prepared as described previously*°, digested with Clal at 37 °C 
for 4h, and subjected to 0.7% agarose gel electrophoresis. After electrophoresis, 
Southern blotting was performed using a standard protocol described previously*". 
A DIG-labelled GFP probe was generated using DIG DNA labelling kit (Roche). 
After hybridization at 65 °C overnight, hybridized DNA fragments were visualized 
via alkaline phosphatase conjugated anti-Dig (1:5,000, Roche). 

PCR analysis of FLP/FRT-mediated dpp knockdown. Fifty wing discs 
from w; dpp'°/dpp*° and w; dpp’°/dpp'°; dpp-GAL4/UAS-FLP were col- 
lected to obtain genomic DNAs using Maxwell 16 system (Promega). Then, 
PCR was carried out using 5’-CCACCGATCCGCCTTATCGGAGG-3’ and 
5!-CGCCGCCTTCAGCTTCTCGTCG-3’ primers. 

Heat shock. Control cross: y,w,hs-FLP; dpp'°/CyO,sChEP x w!"!®; experimental 
cross: y,w,hs-FLP; dpp’°/CyO,sChEP x y,w,hs-FLP; dpp'°/CyO,sChEP. dpp mutant 
cells were induced by heat shock at 37 °C for 30 min at 48, 72 or 96h AEL (ref. 32). 
GAL4/UAS. dpp-GAL4 (dpp**-GAL4; GAL4 expression is controlled by a partial 
dpp** enhancer)**, en-GAL4 (ref. 34), ci-GAL4 (ref. 35), ap-GAL4 (ref. 36) and 
nub-GAL4 (ref. 36) were used to induce FLP to eliminate Dpp expression from 
specific regions of wing discs using the Gal4/UAS system (ref. 37). The efficiency 
of FLP/FRT-mediated recombination was monitored by using Act5c(-FRT)lacZ 
(ref. 38). 

dpp-GAL4. Control cross: w; dpp*°/CyO,sChFP; UAS-FLP/TM6C x w; dpp-GAL4/ 
TMO6B; experimental cross: w; dpp’°/CyO,sChFP; UAS-FLP/TM6C x w; dpp'°/ 
CyO,sChEP; dpp-GAL4/TM6C. 

dpp-GAL4, Act5c(-FRT)lacZ. Control cross: w; dpp’°/CyO,sChFP; UAS- 
FLP.Act5c(-FRT)lacZ/TM6C x dpp-GAL4/TM6B; experimental cross: w; dpp'°/ 
CyO,sChFP; UAS-FLP.Act5c(-FRT)lacZ/TM6C x w; dpp*°/CyO,sChEFP; dpp-GAL4/ 
TM6C. 

en-GAL4,ci-GALA4. Control cross: w; dpp'°/CyO,sChFP; UAS-FLP/TM6C x 
w; en-GAL4,ci-GAL4 (on II); experimental cross: w; dpp’°/CyO,sChFP; UAS-FLP/ 
TM6C x w; dpp*°,en-GAL4,ci-GAL4/CyO,twi-GAL4, UAS-GEP. 

en-GAL4. Control cross: w; dpp*°/CyO,sChFP; UAS-FLP/TM6C x w; en-GAL4 
(on II); experimental cross: w; dpp'°/CyO,sChFP; UAS-FLP/TM6C x w; dpp°, 
en-GAL4/CyO,twi-GAL4, UAS-GFP. 

ci-GALA4. Control cross: w; dpp’°/CyO,sChFP; UAS-FLP/TM6C x w; ci-GAL4 
(on II); experimental cross: w; dpp’°/CyO,sChFP; UAS-FLP/TM6C x w; dpp*®,- 
ci-GAL4/CyO,twi-GAL4, UAS-GFP. 

ci-GAL4, tub-GAL80* . Control cross: w; dpp*°/CyO,sChFP; UAS-FLP, 
tub-GAL80"/TM6C x ci-GAL4, UAS-GFP (on II); experimental cross: w; dpp'°/ 
CyO,sChEP; UAS-FLB tub-GAL80°/TM6C x w; dpp'°,ci-GAL4,UAS-GEP (on II). 
ap-GAL4. Control cross: w; dpp*°/CyO,sChFP; UAS-FLP/TM6C x w; ap-GAL4/ 
T2;3; experimental cross: w; dpp’°/CyO,sChFP; UAS-FLP/TM6C x w; dpp*°, 
ap-GAL4/CyO,twi-GAL4, UAS-GFP. 

nub-GAL4. Control cross: w; dpp*°/CyO,sChEP; UAS-FLP/TM6C x w; nub- 
GAL4 (on II); experimental cross: w; dpp°/CyO,sChFP; UAS-FLP/TM6C x w; 
dpp*°, nub-GAL4/CyO,twi-GAL4, UAS-GEP. 

Immunohistochemistry and imaging. Immunostaining of wing discs was carried 
out as previously described*? with some modifications. Rabbit anti-Dpp (1:100), 
mouse anti-p-H3 (1:1,000, Millipore), mouse anti-Dlg (1:500, DSHB), mouse 
anti-B -galactosidase (Z3781, 1:200, Promega), rabbit anti-pSmad3 (EP823Y, 
1:1,000, Epitomics)*°“", rat anti-Sal (1:200)”, rabbit anti-Omb (1:1,000)*8, guinea 
pig anti-Brk (1:500)*4, mouse anti-D] (C594.9B, 1:500, DSHB)**, and Alexa- 
conjugated secondary antibodies (1:500, Invitrogen) were used for this study. 
All primary antibodies were diluted in Can Get Signal Immunostain Solution 
B (TOYOBO). DNA was visualized using Hoechst 33342 (1:1,000; Thermo 
Scientific). Images of wing discs were obtained using a Leica TCS SP5 confocal 
microscope. 

Image analysis. Sizes of wild-type wing discs at 72h (n= 61), 96h (n=57) and 
120h AEL (n= 84) were measured using the ‘measure’ function of FIJI and ana- 
lysed using Student's t-test. The same method was used to measure discs of the 
following genotypes: dpp™?/dpp"? (n= 25), hs-FLP; dpp*°/dpp’° (heat shocked 
at 48h (n = 23), 72h (n= 22) and 96h AEL (n= 26)), dpp'°/+; dpp-GAL4/ 
UAS-FLP (n= 187) and dpp*°/dpp"°; dpp-GAL4/UAS-FLP (n= 252). Mitotic 
index was determined by dividing the numbers of mitotic cells (p-H3-positive 
cells) by total cell numbers (visualized by anti-Dlg staining) in a 30.03 um square 
in the anterior—ventral wing pouch region. The ‘find maxima function of FIJI 
was used to automatically count total cell numbers. No statistical methods were 
used to predetermine sample size. The experiments were not randomized. The 
investigators were not blinded to allocation during experiments and outcome 
assessment. 
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Extended Data Figure 1 | Endogenous Dpp expression in imaginal Note that dpp-GAL4 is not expressed in the morphogenetic furrow of the 
discs. a-f, Wing (a, b), eye-antenna (c, d), and leg (e, f) imaginal discs third instar eye-antenna disc (arrow in d). Dotted lines show outlines of 
from UAS-GFP/+; dpp-GAL4/-+ larvae are dissected and stained with imaginal discs. Blue: DNA. Scale bars, 100 um. Anterior is left. 


anti-Dpp antibody. GFP (green) indicates dpp-GAL4-expressing cells. 
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Extended Data Figure 2 | Dpp and p-Mad expression in dpp””” clones. a—h, Control (a, b, e, f) and dpp“’? mutant (c, d, g, h) clones are stained with 
anti-Dpp (a-d) and anti-p-Mad (e-h) antibodies. Clones are marked by the absence of GFP (green). Disc boundaries are indicated by dotted lines. Scale 
bars, 50 um. Anterior is to the left. 
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Extended Data Figure 3 | FLP/FRT-mediated conditional dpp-null Dpp start codon and almost half of its coding sequence (the first 288/588 
allele. a, A flowchart describing the establishment of an FLP/FRT- amino acids), FLP/FRT mediated recombination is predicted to yield a 
mediated conditional dpp-null allele. Grey and white boxes indicate null allele. b, dpp’? “"? heterozygous, dpp"° heterozygous, and dpp’° 
untranslated region (UTR) and dpp coding sequences, respectively. FRT homozygous adult flies. Importantly, dpp’° homozygous animals have 


sequences flank the first coding exon (exon 5). Since this exon contains the — normal adult morphology. 
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Extended Data Figure 4 | Validation of an FLP/FRT-mediated 
conditional allele. a, FRT 5’-loxP and FRT 3’ genomic regions are 
sequenced. b, Southern blot analysis of dpp’?°"”. Genomic DNAs 

from w!"/8 and dpp*®-C? are digested by Clal and are subjected to 
Southern blot analysis using a GFP probe. c, Molecular confirmation of 
the FLP/FRT-mediated dpp FLP-OFF system. As expected, an FLP-OFF 
product (2,349 bp PCR fragment) is only detected in the dpp*°/dpp"°; 
dpp-GAL4/UAS-FLP lane. Asterisk indicates a non-specific PCR product. 
d, Biochemical evidence of the FLP/FRT-mediated dpp FLP-OFF system. 
y,w,hs-FLP; dpp*°/dpp"® larvae are incubated at 37 °C for 30 min at 96h 


AEL to eliminate Dpp expression. After 24 h, wing discs are homogenized 
in SDS sample buffer and analysed by western blot analysis with anti-Dpp. 
Non-specific bands are indicated by an asterisk. Anti-a-tubulin is used 

as a loading control. e, A system to visualize the efficiency of FLP/FRT- 
mediated recombination. f-k, dpp"°/+; dpp-GAL4/UAS-FLP, Act5c(-FRT) 
lacZ controls (f, g, h) and dpp*°/dpp"°; dpp-GAL4/UAS-FLP,Act5c(-FRT) 
lacZ (i, j, k) wing discs are stained with anti-Dpp and B-galactosidase 
antibodies. The lineage of dpp-GAL4-expressing cells is visualized by 
anti-B-galactosidase staining. Scale bar, 100 1m. Anterior is left. 
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Extended Data Figure 5 | p-Mad staining of wing discs lacking dpp (c, d) are dissected and stained with anti-p-Mad antibody. The dorso- 
function in the dorsal compartment. a-d, dpp*°/ap-GAL4, UAS-GFP; ventral boundaries are indicated by green dotted lines. Yellow dotted lines 
UAS-FLP/+ (a, b) and dpp*°/dpp*°, ap-GAL4, UAS-GFP; UAS-FLP/+ show the disc areas. Scale bar, 50 um. 
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Extended Data Figure 6 | Elimination of Dpp from specific regions of UAS-FLP/+ (e), and dpp*°, nub-GAL4/dpp*°; UAS-FLP/+ (f). Gal4- 

wing discs. a-f, Anti-Dpp antibody staining of wild-type (a), dpp”®, expressing domains are highlighted in grey in each illustration. Wing disc 
ci-GAL4,en-GAL4/dpp"; UAS-FLP/+ (b), dpp"°,en-GAL4/dpp"°; boundaries are shown by dotted lines. Scale bar, 100 tm. Anterior is left. 
UAS-FLP/+ (c), dpp’°,ci-GAL4/dpp"°; UAS-FLP/+ (d), dpp"°,ap-GAL4/dpp"®; 
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Extended Data Figure 7 | Spatiotemporal Dpp removal from the stained with anti-Dpp. Gal4 activity is monitored by GFP expression. 
anterior region of wing discs. a, A strategy for temporal Dpp elimination Scale bar, 100 xm. g, Size comparison between wing discs: dpp*°/ci-GAL4, 
from the anterior compartment of wing discs using the GAL80* system. UAS-GFP; UAS-FLP tub-GAL80"/+ (0 (n=21) and 72h (n= 40) before 
At 18 °C, Gal4 activity is blocked by Gal80. When flies are kept at 29 °C dissection) and dpp*°/dpp*°, ci-GAL4, UAS-GFP; UAS-FLP, tub-GAL80"/+ 
(non-permissive temperature for GAL80"), Gal4 induces expression of (0 (n= 36), 18 (n= 33), 24 (n= 40), 36 (n =33), 48 (n= 80) and 72h 
FLP and GFP. b, Timing of temperature shift. Larvae are reared at 18°C, (n=43) before dissection). Mean + s.d. *P < 0.001, not significant (NS), 
and are transferred to 29 °C at the indicated time points before dissection. two-sided Student's t-test. 


c=f, dpp'°/ci-GAL4, UAS-GFP; UAS-FLP, tub-GAL80‘/+ controls are 
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Diversion of aspartate in ASS1-deficient tumours 
fosters de novo pyrimidine synthesis 


Shiran Rabinovich!*, Lital Adler'*, Keren Yizhak*, Alona Sarver', Alon Silberman!, Shani Agron!, Noa Stettner!, Qin Sun?, 
Alexander Brandis*, Daniel Helbling’, Stanley Korman’, Shalev Itzkovitz’, David Dimmock’, Igor Ulitsky', 


Sandesh C. S. Nagamani**, Eytan Ruppin??"” & Ayelet Erez! 


Cancer cells hijack and remodel existing metabolic pathways 
for their benefit. Argininosuccinate synthase (ASS1) is a urea 
cycle enzyme that is essential in the conversion of nitrogen from 
ammonia and aspartate to urea. A decrease in nitrogen flux through 
ASS1 in the liver causes the urea cycle disorder citrullinaemia'. In 
contrast to the well-studied consequences of loss of ASS1 activity 
on ureagenesis, the purpose of its somatic silencing in multiple 
cancers is largely unknown2. Here we show that decreased activity 
of ASS1 in cancers supports proliferation by facilitating pyrimidine 
synthesis via CAD (carbamoyl-phosphate synthase 2, aspartate 
transcarbamylase, and dihydroorotase complex) activation. Our 
studies were initiated by delineating the consequences of loss of 
ASS1 activity in humans with two types of citrullinaemia. We find 
that in citrullinaemia type I (CTLN I), which is caused by deficiency 
of ASS1, there is increased pyrimidine synthesis and proliferation 
compared with citrullinaemia type II (CTLN II), in which there 
is decreased substrate availability for ASS1 caused by deficiency 
of the aspartate transporter citrin. Building on these results, we 
demonstrate that ASS1 deficiency in cancer increases cytosolic 
aspartate levels, which increases CAD activation by upregulating 
its substrate availability and by increasing its phosphorylation 
by S6K1 through the mammalian target of rapamycin (mTOR) 
pathway. Decreasing CAD activity by blocking citrin, the mTOR 
signalling, or pyrimidine synthesis decreases proliferation and thus 
may serve as a therapeutic strategy in multiple cancers where ASS1 is 
downregulated. Our results demonstrate that ASS1 downregulation 
is a novel mechanism supporting cancerous proliferation, and 
they provide a metabolic link between the urea cycle enzymes and 
pyrimidine synthesis. 

In contrast to the well-delineated biochemical and clinical conse- 
quences of loss-of-function germline mutations in ASS1, which have 
not been reported to include cancer, studies have shown a correlation 
between somatic deficiency of ASS1 in cancer and poor prognosis, 
for which the mechanism remains obscure*?. Outside the liver, ASS1 
is expressed in most tissues where it catalyses the penultimate step in 
the synthesis of arginine. Argininosuccinate lyase (ASL), the enzyme 
downstream of ASS1, is directly responsible for arginine synthesis* 
(Fig. 1a). A well-established sequel of ASS1 and/or ASL deficiency is 
arginine auxotrophy”; thus, arginine-catabolizing enzymes have been 
used as therapy in ASS1-depleted tumours with limited benefit, espe- 
cially in melanoma, wherein the cancer cells develop resistance by 
re-expressing ASS1 within days’. Since there are cancers in which both 
these genes are epigenetically silenced®, ASS1 deficiency in cancers 
might have an arginine-independent effect, which may be related to 
its substrate, aspartate (Fig. la). 


In the cytosol, aspartate serves as a substrate both for ASS1 and for 
the enzymatic complex CAD. We thus hypothesized that decreased 
ASS] activity might enhance aspartate availability for CAD for the syn- 
thesis of pyrimidine nucleotides to promote proliferation (Fig. 1a). If 
correct, deficiency in the mitochondrial aspartate transporter, citrin, 
would be expected to decrease aspartate availability both for ASS1 and 
for CAD and hence restrict proliferation (Fig. 1a). 

We first assessed the correlation between ASS1 levels and prolifera- 
tion in non-cancerous states. A generic stoichiometric model of human 
metabolism’ predicted that inactivation of ASS1 is significantly asso- 
ciated with an increase in growth rate, and is additionally predicted to 
increase flux through the reaction catalysed by CAD (Fig. 1b). Thus, 
we expected subjects with ASS1 deficiency (CTLN I) to have increased 
synthesis of pyrimidines owing to increased utilization of aspartate by 
CAD, compared with those with CTLN II in whom aspartate availabil- 
ity to CAD is decreased (Fig. la). Indeed, urinary levels of orotic acid, 
a product reflecting the synthetic activity of CAD, were significantly 
elevated in human subjects with CTLN I compared with the norma- 
tive values from a control population and with subjects with CTLN II 
(Fig. la, c). Moreover, we found that CTLN I fibroblasts have increased 
synthesis of pyrimidines and proliferation compared with CTLN II cells 
(Fig. 1d, e). Using ['"Ns]a-glutamine we further showed that CTLN I 
cells generate more total as well as labelled M + 1 aspartate and M+ 1 
uracil, compared with control and CTLN II fibroblasts (Fig. 1f, g and 
Extended Data Fig. 1a-c). Hence, there is a specific decrease in aspar- 
tate transport from the mitochondria in CTLN II, leading to reduced 
aspartate availability for pyrimidine synthesis and restricting prolifera- 
tion. Interestingly, growth restriction has been reported in humans with 
CTLN II® but no growth aberrancies have been reported in CTLN I, 
further providing a clinical human context to the findings and suggest- 
ing that, in physiological proliferation, aspartate deficiency has more 
severe clinical consequences than its enrichment. To corroborate our 
results in another model system, we analysed Ass] messenger RNA 
(mRNA) levels in wild-type newborn mouse intestines, which express 
high levels of Ass1 and contain proliferating and differentiated cells in 
the crypts and villi, respectively’. We found a significant correlation 
between the levels of Ass1 and Glut2, a mature enterocyte marker in 
the differentiated enterocytes in the villi, whereas a significant inverse 
correlation was observed between Ass1 and Ki67, a marker of prolifer- 
ation, in the proliferating cells in the crypts (Fig. 1h). Thus, ASS1 inac- 
tivation has an important role in proliferation of non-cancerous cells, 
by increasing aspartate availability for pyrimidine synthesis by CAD. 

We next evaluated whether this mechanism could be the reason 
for the downregulation of ASS1 in cancer. According to the well- 
established “Warburg effect; different metabolites are diverted from 
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Figure 1 | ASS1 inactivation correlates with non-cancerous 
proliferation. a, Illustration of the metabolic flux involved in nitrogen 
contributions to nucleic acid synthesis. The aspartate nitrogen can be 
used for synthesis of pyrimidines (green path) or urea (blue path). In 
ASS1 deficiency (CTLN I) there is a potential diversion of the aspartate 
towards pyrimidine whereas in citrin deficiency (CTLN II) aspartate is 
not transported across the mitochondria. ASA, argininosuccinate; ARG, 
arginase. b, Prediction by the generic human model; decreasing ASS1 
activation (green line) results in an increase in the cellular growth rate and 
in the flux through the CAD reaction (blue line); a.u., arbitrary units. 

c, Urinary orotic acid levels are elevated significantly in patients 

with CTLN I compared with normative values in control subjects 

(0.3-2.8 mmol mol™! creatinine, depicted by the red dashed line), and 
with those with CTLN II. **P < 0.005 using log-transformed data for 
t-test analysis (n =5 with CTLN I and n = 4 with CTLN II). d-g, These 
experiments were repeated twice with pooled cells from two patients with 
CTLN UH, from one patient with CTLN I and from three control subjects. 
Statistical analysis used one-way analysis of variance (ANOVA). Error bars 
represent standard error. d, Significantly lower levels of pyrimidines in 


their ‘routine pathways’ for the synthesis of biological molecules that are 
essential for cell division and growth. We hence conducted an analysis 
of ASS1 expression data in cancer cell lines from the NCI-60 collection 
and found a significant inverse correlation between ASS1 expression 
levels and the reported doubling time of the cancerous cells (Fig. 2a). 
To further test whether this correlation is explicable by diversion of 
aspartate flux, we used our modelling program and predicted that, with 
ASS1 inactivation, there is an accompanying significant increase in 
aspartate flux through the relevant metabolic reactions for nucleic acid 
synthesis (Extended Data Table 1). In contrast, modelling the inacti- 
vation of ASL predicted an endogenous arginine depletion that does 
not directly affect the flux towards nucleic-acid synthesis (data not 
shown). Furthermore, analysis of The Cancer Genome Atlas database 
for ASL and ASS1 expression shows that these genes can both be down- 
regulated in the same cancers, suggesting that they are not mutually 
exclusive (Extended Data Fig. 1d). Thus, ASS1 silencing in cancerous 
proliferation might have an arginine-independent effect that is related 
to nucleotide synthesis. 

Using specific metabolic models tailored for each of the NCI-60 cell 
lines!°, we further predicted that 8 out of the 13 metabolites compu- 
tationally shown to be increased with ASS1 inactivation were nucleic 
acids (Fig. 2b and Extended Data Fig. le). Additionally, specific analysis 
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CTLN II fibroblasts compared with CTLN I and with normal fibroblasts. 
TMP was undetectable. e, Top: crystal violet staining on day 5 shows 
increased proliferation in CTLN I compared with CTLN I and with 
normal control. Bottom: a quantification graph for the staining. (ANOVA; 
***P <(.0005). f, g, Primary fibroblasts from CTLN I subjects have 
increased levels of aspartate (f) and uracil (g) compared with fibroblasts 
from patients with CTLN II and with normal controls (ANOVA, Tukey’s 
honest significant difference test, *P < 0.0005). This is a representative 
graph of three independent experiments. h, Left: a small-intestine tissue 
section of a mouse was hybridized with FISH probe libraries for single 
Ass1 (green dots) and for Ki67 mRNA molecules (red dots). Nuclei were 
stained with DAPI (4’,6-diamidino-2-phenylindole; blue). Dashed line 
outlines the crypt bottom. White lines mark cell borders according to 
co-immunofluorescence staining with FITC-E-cadherin. A magnifying 
window highlights a clear demarcation of Ass! mRNA localizing to the 
differentiated cells of the crypt (magnification x 100). Goblet cells exhibit 
some non-specific fluorescence appearing in multiple channels. Right: 
quantified correlations between Ass1 and Glut2, an enterocyte marker 
(top), and between Ass1 and Ki67 (bottom). 


of the TCGA database of tumours where ASS1 expression is downregu- 
lated showed a significant upregulation in the expression of CAD, com- 
pared with the paired normal tissue (Fig. 2c). We further confirmed 
the inverse upregulation in the expression of CAD versus ASS1 at the 
mRNA level in the NCI-60 cancer cell-line database as well as in inde- 
pendent databases for patients with osteosarcoma'! and melanoma”; 
we found that downregulation of ASS1 and upregulation of CAD are 
in concordance with cancerous phenotype (Extended Data Fig. 1f, g). 
In addition, we demonstrated the inverse expression levels between 
ASS1 and CAD at the protein level using osteosarcoma and melanoma 
cell lines that differ in their expression pattern of ASS1 (Extended Data 
Figs 1h and 2a). To validate these modelling and global informatics 
analyses with experimental evidence, we studied osteosarcoma cell lines 
in which ASS1 was either deficient (MNNG/HOS) or present (U2O0S) 
(Fig. 2d and Supplementary Fig. 1). Metabolic analyses confirmed that 
cells deficient in ASS1 had an increase in pyrimidine levels, an increase 
in the level of uracil as well as a significantly increased proliferation rate 
(Fig. 2e-g) compared with osteosarcoma cells having higher levels of 
ASS1. We additionally verified these results in melanoma cell lines that 
differed in their level of ASS1 expression (Extended Data Fig. 2b-d). 
To definitively demonstrate the direct correlation between ASS1 
expression and proliferation and to differentiate it from other metabolic 
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Figure 2 | ASS1-deficient tumours have increased proliferation rate and 
pyrimidine levels. a, Top: ASS1 is ranked within the top 24 genes out of 
14,000, that show a significant inverse correlation with proliferation rate 
among the 16 NCI-60 cancer cell lines in which ASS1 is downregulated; 
Spearman’s rank correlation was calculated between the expression of 
each gene and its associated proliferation rates. Bottom: magnified view 
of the correlation between proliferation rate and ASS1 expression levels. 
b, Predicted differences in the production rate of biomass components 
after the inactivation of ASS1. The production of nucleic acids marked 
in arrows, is predicted to have a large increase after ASS1 inactivation in 
most of the NCI-60 cell lines. The figure represents the results obtained 
using the LOX IMVI cell line model; however, the same results were seen 
for all NCI-60 cell lines as well as in using the human generic model. 


changes that occur in cancer cells, we overexpressed ASS1 in MNNG/ 
HOS, and knocked it down in U20S cells (Fig. 3a). Our results clearly 
show that changes in ASS1 levels inversely alter the proliferation rate 
and pyrimidine synthesis in these cells (Fig. 3b—e and Extended Data 
Fig. 3a—f). If the major determinant by which ASS1 overexpression 
decreases proliferation is through diverting aspartate metabolism away 
from pyrimidine synthesis, supplementation with nucleic acids should 
restore proliferation. Indeed, supplementing the media with nucleic 
acids, specifically with pyrimidines, significantly restores the prolifer- 
ation of ASS1-overexpressing cells to a similar level as the parental cell 
line (Fig. 3f and Extended Data Fig. 2e-j). Thus, in two distinct forms 
of cancer, changes in ASS1 expression levels directly affect aspartate 
utilization for pyrimidine synthesis and proliferation. Importantly, 
similar results were obtained in vivo; mice injected with melanoma 
cells knocked down for ASS1 developed tumours that grew more rap- 
idly and had higher levels of total and M + 1-labelled aspartate and 
uracil than the parental tumour cells that expressed the empty vector 
(Fig. 3g, h and Extended Data Fig. 3g). 

An expected synergistic way to increase aspartate delivery for 
pyrimidine synthesis would be by upregulation of citrin. Analysis 
of the TCGA database showed that in tissues that normally do not 
express citrin (also known as SLC25A13) at high levels', there is a 
significantly elevated expression in the cancerous state (Extended Data 
Fig. 4b). In addition, in the liver where citrin is strongly expressed, a 
recent publication of ASS1 expression in hepatocellular carcinoma 
showed that downregulation of ASS1 is associated with a more malig- 
nant cancerous phenotype“*. These results, together with our study of 
primary human fibroblast cells (Fig. 1c—g), imply that proliferation 


The models used are based on a series of simplifying assumptions as 
described by us previously in detail!®. c, Analysis of the TCGA database 
of matched tumour-normal tissue pairs showing that CAD expression is 
elevated significantly in tumours with ASS1 downregulation compared 
with normal tissue (*P < 0.05). d, Immunoblot of osteosarcoma cell lines 
showing decreased expression of ASS1 compared with the loading control 
GAPDH in MNNG/HOS compared with U2OS. e, Osteosarcoma cells 
with ASS1 downregulation have a significant increase in pyrimidine levels 
as measured by LC-MS (n> 3; *P < 0.05; ***P < 0.0005). Error bars are 
standard error. f, Osteosarcoma cells with ASS1 downregulation have a 
significant increase in uracil (n = 3; *P < 0.05). Error bars are standard 
error. g, Osteosarcoma cells with ASS1 downregulation have a significant 
increase in proliferation as measured by MTT assay (** *P < 0.0005). 


induced by loss of ASS1 in tumours might be counteracted by inhib- 
iting citrin. Indeed, si-citrin in U2OS decreases proliferation signif- 
icantly when ASS1 levels are reduced (Fig. 4a). Use of si-citrin also 
decreases pyrimidine levels as well as total and labelled levels of M+ 1 
aspartate and of M+ 1 orotic acid (Fig. 4b, c and Extended Data Fig. 
4c, d). As citrin is part of the malate—aspartate shuttle, its deficiency 
is expected to affect several aspects in cell survival and growth. Our 
results indicate that citrin function in transferring mitochondrial- 
derived aspartate is important for supplying substrate for pyrimidine 
synthesis, especially in cancers with ASS1 downregulation. These find- 
ings are therapeutically relevant as survival analysis of several can- 
cers in the TCGA database reveals that cancers with decreased ASS1 
expression and high citrin levels have a trend for significantly worse 
prognosis (Fig. 4d, Extended Data Fig. 4e and Extended Data Table 2). 

The use of citrin-derived aspartate by CAD requires CAD activa- 
tion. Recently, CAD was shown to be activated by ribosomal protein S6 
kinase (S6K1), regulated by the mTOR pathway’. When ASS1 expres- 
sion in cancer cells is decreased, we find increased phosphorylation of 
S6K1 and CAD that is reduced by si-citrin, implying that aspartate levels 
are important in regulating the mTOR pathway activation (Fig. 4e and 
Extended Data Fig. 4f). In addition, we show a significant increase in 
the location proximity between CAD and citrin after ASS1 downreg- 
ulation (Extended Data Fig. 4g). Thus, aspartate regulates pyrimidine 
levels by regulating CAD’s substrate availability, protein localization 
and activity. Consistent with this, we see a decrease in proliferation 
when ASS 1-deficient cells are treated either with the mTOR inhibitor 
rapamycin or with thymidylate synthase inhibitor 5-fluorouracil (SFU) 
(Fig. 4f). Importantly, rapamycin treatment is accompanied by a 
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Figure 3 | ASS1 expression levels in cancer determine aspartate 
availability for pyrimidine synthesis. a, Immunoblots of osteosarcoma 
cells after transduction with either ASS1 overexpression construct (left) or 
with ASS1-shRNA (right). b, MTT proliferation assay in osteosarcoma cells 
showing a significant decrease in proliferation after ASS1 overexpression 
(***P < 0.0005). c, MTT proliferation assay in osteosarcoma cells 

showing a significant increase in proliferation after transduction with 
shASS1 (*P < 0.05). The proliferation values are shown for day 3, after 
normalizing the data for the reading after cell adherence (n > 3). d, LC-MS 
measurements of pyrimidine levels showing a significant increase after the 
use of ASS1-shRNA in osteosarcoma cells (n > 3). e, Total uracil is decreased 
significantly in osteosarcoma cells with ASS1 overexpression (n > 3; 

**P — 0,005). f, Left: INTP supplementation rescues proliferation after ASS1 
overexpression in osteosarcoma cells. Right: pyrimidines significantly rescue 


proliferation in ASS1-overexpressing MNNG/HOS cells (> 3; *P < 0.05, 
** P< 0).005, ***P < 0.0005). g, h, Ten million MALME-3m melanoma cells 
transduced with either pLKO empty vector or with shASS1 were injected 
subcutaneously to immune-deficient mice. The experiment was repeated 
three times. After euthanasia, the tumours were removed, measured and 
incubated with labelled [/°N]a-glutamine for 6 h. Two weeks after injection, 
the group injected with melanoma cells with shASS1 developed tumours that 
grew more rapidly in size (g, left) and were hence significantly larger when 
removed (g, right). The experiment was repeated three times with similar 
results (**P < 0.005, ***P < 0.0005). h, Tumours with shASS1 had higher 
levels of total aspartate (left) and total uracil (right) than those expressing 
the empty vector. Statistical analysis used repeated-measurements ANOVA 
(n = 15; *P< 0.05; **P< 0.005). All error bars are standard error. 
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Figure 4 | Decreasing CAD activation reduces proliferation in ASS1- 
deficient cancers. a, MTT assay showing that decreasing citrin levels 
significantly decreases proliferation in U2OS osteosarcoma, even after a 
significant proliferation increase is accomplished by ASS1 downregulation 
(n> 3; *P<0.05; **P< 0.005; ***P < 0.0005). b, Decreasing citrin levels 
decreases pyrimidine levels in U2OS cells with ASS1 downregulation (n > 3; 
*P <0.05). ¢«, GC-MS measurements showing that U2OS with shASS1 has 

a significant increase in total aspartate (left), as well as in total orotic acid 
(right), which are reversed when transfected with si-citrin (n> 3; ANOVA, 
Tukey's honest significant difference test, *P < 0.05). Error bars are standard 
error. d, Kaplan-Meier survival analysis for two different cancer types (BLCA, 
bladder cancer; LUAD, lung adenocarcinoma), both showing significantly 
poor survival for cancers with low ASS1 and high citrin expression levels 
(n> 3; **log rank P< 0.005). e, Immunoblot of osteosarcoma cells for the 
mTOR pathway downstream effectors S6K1 and CAD showing increased 
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phosphorylation after shASS1 that is reversed when cells are transfected with 
si-citrin. f, Quantification graph of crystal violet staining of osteosarcoma cells 
transduced with shASS1 after drug treatments (Tukey’s post-hoc; P < 0.0005). 
All treatments were significant compared with no treatment. In addition, 

the results show a significant beneficial effect of decreased proliferation in 
response to treatment with either mTOR or pyrimidine synthesis inhibitors 
(rapamycin or 5FU respectively), with 5FU being more beneficial than 
rapamycin (P< 0.0005). Of note, SFU had a significant additive beneficial 
effect when added as treatment to arginine-deleted medium (Tukey’s post- 
hoc test; P< 0.0005). Cells were grown in normal medium, in medium 
depleted of arginine, in complete medium with either rapamycin or 5FU, and 
in arginine-depleted medium together with either rapamycin or 5FU (n=9; 
** PD <().0005). g, Western blot showing decreased activation of the mTOR 
proteins after rapamycin treatment. h, Schematic presentation for potential 
interventions in pyrimidine synthesis in ASS1-deficient tumours. 
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decrease in CAD phosphorylation (Fig. 4g). Hence, targeting aspar- 
tate transport could be an additional therapeutic option in cancers 
with ASS1 silencing, especially in those that develop resistance to 
arginine-depleting agents (Fig. 4h). 

In summary, our studies demonstrate that ASS1, a urea cycle enzyme, 
facilitates pyrimidine synthesis in cancerous proliferation by activating 
CAD, through regulation of aspartate levels. There are several clinical 
trials in patients with ASS1-deficient hepatocellular carcinoma and 
mesothelioma, which combine arginine-depleting agents with thymi- 
dylate synthase inhibitors such as capecitabine and pemetrexed (http:// 
clinicaltrials.gov, NCT02089633, NCT02029690). We believe our study 
provides the rationale for such therapeutic modalities and hence has 
direct translational relevance. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Unless mentioned otherwise, the experiments were not randomized and the 
investigators were not blinded to allocation during experiments and outcome 
assessment. 

Measurements in human subjects. The fibroblast studies were performed on 
anonymized cells devoid of all identifiers. The data analysis involving urine orotic 
acid levels were performed under a protocol approved by the Institutional Review 
Board of Baylor College of Medicine. Urine samples were prepared by mixing 
200 ul of with isotopic internal standard [!*N2]orotic acid (Cambridge Isotope 
Laboratories). Orotic acid and orotidine were assayed on a Micromass Quattro 
mass spectrometer (Waters). HPLC was performed on a Waters ODS-AQ analyt- 
ical column (150mm x 2.0mm internal diameter 5 pm bead size). Mobile phase 
was isocratic 0.05 M ammonium formate (pH 4.0). The MS-MS system was set 
at a flow rate of 0.2ml min™!. The mass spectrometer was operated in electro- 
spray ionization negative multiple-reaction monitoring mode. Nitrogen was used 
as nebulizer gas at a flow rate of 60-901 h~! and desolvation gas 5001h~!. Other 
optimized mass spectrometer parameters were cone voltage 15 V, capillary 3,250 V 
and collision voltage 10 V. 

Genome-scale metabolic modelling. A metabolic network consisting of m metab- 
olites and n reactions can be represented by a stoichiometric matrix S, where the 
entry Sj represents the stoichiometric coefficient of metabolite i in reaction j’”. 
A constraint-based mode imposes mass balance, directionality and flux capacity 
constraints on the space of possible fluxes in the metabolic network's reactions 
through a set of linear equations: 


‘Sx v=0 (1) 
Vnin S VS Vinax (2) 


where v stands for the flux vector for all of the reactions in the model (that is, the 
flux distribution). The exchange of metabolites with the environment is represented 
as a set of exchange (transport) reactions, enabling a pre-defined set of metabo- 
lites to be either taken up or secreted from the growth media. The steady-state 
assumption represented in equation (1) constrains the production rate of each 
metabolite to be equal to its consumption rate. Enzymatic directionality and flux 
capacity constraints define lower and upper bounds on the fluxes and are embed- 
ded in equation (2). In the following, flux vectors satisfying these conditions will be 
referred to as feasible steady-state flux distributions. The analyses were performed 
under the Roswell Park Memorial Institute Medium (RPMI)-1640m. We used the 
biomass function introduced in ref. 16. 

Predicting growth rate, metabolite production and flux distribution through 
metabolic modelling. To determine the relation between ASS1 activity, CAD activ- 
ity and growth rate, we used a generic human model and simulated the inactivation 
and activation of the reaction catalysed by ASS1. The inactivation was simulated 
by constraining the flux through the ASS1 reaction to zero, while the activation 
was simulated by enforcing increased positive flux through the ASS1 reaction up 
to the maximal possible flux, as computed via flux variability analysis!”. At each 
such point, the maximal growth rate was computed via flux balance analysis’’. 
Additionally, we estimated the flux through the reaction catalysed by CAD under 
maximal growth rate on the basis of 1,000 different feasible flux samples!®, 

We next used genome-scale metabolic models for each of the NCI-60 cancer 
cell lines on the basis of their gene expression measurements. In each cell-line 
model, we performed the following analyses. (1) We computed the production of 
each biomass component under both the inactivation and maximal activation of 
ASS1, as described above. The difference between the predicted production rates 
of each biomass component in the two states was then computed on the basis 
of the results of this optimization problem. (2) Similarly, we examined the flux 
change of each reaction under maximal biomass production in both the inactiva- 
tion and activation states, as described above. In each of these states, we sampled 
the solution space and obtained 1,000 feasible flux distributions!®. Focusing on the 
reactions associated with aspartate and glutamine, we computed the fold-change 
in flux rate together with its significance level. The latter was computed via a two- 
sided Wilcoxon rank-sum test. The largest fold-change among these reactions was 
predicted for the reactions catalysed by the CAD enzyme. 

TCGA data analysis. For each tumour, normalized gene expression levels meas- 
ured using RSEM'? were obtained from the RNASeqV2 data sets at the TCGA 
portal (https://tcga-data.nci.nih.gov/tcga/). Only matched tumour-normal pairs 
were used. For each tumour type, we computed the mean expression levels in the 
tumour and normal samples, added a pseudo-count of 1 to each mean and plotted 
the ratio between the means. 

Metabolomics analysis. Osteosarcoma or melanoma cell lines were seeded 
at 3 x 10° to 5 x 10° cells per 10 cm plate and incubated with either 4 mM 


L-glutamine, (a-!°N, 98%, Cambridge Isotope Laboratories) for 24h. Subsequently, 
cells were washed with ice-cold saline, lysed with 50% methanol in water and 
quickly scraped followed by three freeze-thaw cycles in liquid nitrogen. The insol- 
uble material was pelleted in a cooled centrifuge (4°C) and the supernatant was 
collected for consequent GC-MS analysis. Samples were dried under air flow 
at 42°C using a Techne Dry-Block Heater with sample concentrator (Bibby 
Scientific) and the dried samples were treated with 40 ul of a methoxyamine 
hydrochloride solution (20 mg ml”! in pyridine) at 37°C for 90 min while shak- 
ing followed by incubation with 70 ul N,O-bis (trimethylsilyl) trifluoroacetamide 
(Sigma) at 37 °C for an additional 30 min. 

GC-MS. GC-MS analysis used a gas chromatograph (7820AN, Agilent 
Technologies) interfaced with a mass spectrometer (5975 Agilent Technologies). 
An HP-5ms capillary column 30m x 250 um x 0.25 um (19091S-433, Agilent 
Technologies) was used. Helium carrier gas was maintained at a constant flow rate 
of 1.0 ml min~!. The GC column temperature was programmed from 70 to 150°C 
via a ramp of 4°C min™!, 250-215°C via a ramp of 9°C min“, 215-300°C via a 
ramp of 25°C min! and maintained at 300°C for an additional 5 min. The MS was 
by electron impact ionization and operated in full-scan mode from m/z= 30-500. 
The inlet and MS transfer line temperatures were maintained at 280°C, and the 
ion source temperature was 250°C. Sample injection (1 tl) was in splitless mode. 
Nucleic acid analysis. Materials. Ammonium acetate (Fisher Scientific) and 
ammonium bicarbonate (Fluka) of LC-MS grade were used. Sodium salts of AMP, 
CMP, GMP, TMP and UMP were obtained from Sigma-Aldrich. Acetonitrile of 
LC grade was supplied from Merck. Water with resistivity 18.2 MO. was obtained 
using Direct 3-Q UV system (Millipore). 

Extract preparation. The obtained samples were concentrated in speedvac to 
eliminate methanol, and then lyophilized tol dryness, re-suspended in 200 ul of 
water and purified on polymeric weak anion columns (Strata-XL-AW 100 um 
(30mg ml, Phenomenex)) as follows. Each column was conditioned by passing 
1 ml of methanol, then 1 ml of formic acid/methanol/water (2/25/73) and equili- 
brated with 1 ml of water. The samples were loaded, and each column was washed 
with 1 ml of water and 1 ml of 50% methanol. The purified samples were eluted 
with 1 ml of ammonia/methanol/water (2/25/73) followed by 1 ml of ammonia/ 
methanol/water (2/50/50) and then collected, concentrated in speedvac to remove 
methanol and lyophilized. Before LC-MC analysis, the obtained residues were 
re-dissolved in 100 ul of water and centrifuged for 5 min at 21,000 g to remove 
insoluble material. 

LC-MS analysis. The LC-MS/MS instrument consisted of an Acuity I-class 
UPLC system (Waters) and Xevo TQ-S triple quadruple mass spectrometer 
(Waters) equipped with an electrospray ion source and operated in positive 
ion mode was used for analysis of nucleoside monophosphates. MassLynx and 
TargetLynx software (version 4.1, Waters) were applied for the acquisition and 
analysis of data. Chromatographic separation was done on a 100mm x 2.1 mm 
internal diameter, 1.8-um UPLC HSS T3 column equipped with 50mm x 2.1mm 
internal diameter, 1.8-um UPLC HSS T3 pre-column (both Waters Acuity) with 
mobile phases A (10 mM ammonium acetate and 5 mM ammonium hydro- 
carbonate buffer, pH 7.0 adjusted with 10% acetic acid) and B (acetonitrile) at 
a flow rate of 0.3 ml min~! and column temperature 35°C. A gradient was used 
as follows: for 0-6 min the column was held at 0% B, then 6-6.5 min a linear 
increase to 100% B, 6.5—7.0 min held at 100% B, 7.0-7.5 min back to 0% B and 
equilibration at 0% B for 2.5 min. Samples kept at 8 °C were automatically injected 
in a volume of 3 ul. 

For mass spectrometry, argon was used as the collision gas with a flow of 

0.25 ml min. The capillary voltage was set to 2.90 kV, source temperature 150°C, 
desolvation temperature 350°C, desolvation gas flow 6501 min“. Analytics were 
detected using multiple-reaction monitoring and applying the parameters listed 
in Supplementary Table 3. 
Hybridizations and imaging. Single-molecule FISH (smFISH) was performed 
with probe libraries for Ass1 (74 probes, sequences described in Supplementary 
Methods) and Ki67 (96 probes”°). Imaging was performed as previously 
described”®, smFISH images were filtered with a Laplacian of Gaussian filter of 
size 15 pixels and standard deviation of 1.5 pixels. Each image is a maximum 
projection of ten stacks spaced 0.3 um apart in the z-direction. Each dot in these 
figures represents a cell and the quantification dots were counted on eight z-stacks 
spaced 0.3 um apart (total tissue volume 2.4 um). 

Proximity ligation assay. The assay was performed as published”! using Sigma 
Aldrich kit (DUO 92004-30-RXN). Antibodies used for detection were diluted in 
PBS: ASS1 (1:200, ab170952, abcam), citrin (1:100, H0O0010165-MOl1, clone 4F4, 
abnova) and anti-CAD (1:100, ab40800, abcam). 

Cell cultures. All cell lines were authenticated. Melanoma cell lines LOX IMVI and 
MALME-3m and osteosarcoma cell lines MNNG/HOS and U20S were purchased 
from American Type Culture Collection (ATCC) and cultured using standard 
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procedures in a 37°C humidified incubator with 5% CO in RPMI (Invitrogen) 
supplemented with 10-20% heat-inactivated fetal bovine serum, 10% pen-strep 
and 2 mM glutamine. All cells are tested routinely for mycoplasma using a 
Mycoplasma EZ-PCR test kit (20-700-20, Biological Industries). 

Proliferation assays. MTT assay. Cells were seeded in 12-well plates at 4 x 10‘ to 
8 x 10° cells per well in a triplicate. After 6h for adherence of the cells, 0.1 mg ml 
of MTT (3-(4,5-dimethylthiazol-2-yl)-2,5 diphenyltetrazolium bromide) (CAS 
298-93-1, Calbiochem) in PBS was added to each cell type, starting at 0h, in 24h 
intervals. Deoxynucleotide Set (DNTP100-1KT, Sigma-Aldrich) was added to the 
cells’ medium first after adherence and then daily at a final concentration of 10 uM. 
Cells were lysed with dimethylsulfoxide (DMSO). Absorbance was measured at 
570nm. 

Crystal violet staining. Cells were seeded in 12-well plates at 40,000-100,000 

cells per well in a triplicate. Time 0 was calculated as the time the cells became 
adherent, which was after about 6h from plating. For each time point, cells were 
washed with PBS X1 and fixed in 4% PFA (in PBS). Cells were then stained with 
0.1% Crystal Violet (C0775, Sigma-Aldrich) for 20 min (1 ml per well) and washed 
with water. Cells were then incubated with 10% acetic acid for 20 min with shaking. 
Extract was then diluted 1:4 in water and absorbance was measured at 590 nm 
every 24h. 
Protein and RNA analysis. Western blotting. Cells were lysed in RIPA (Sigma- 
Aldrich) and 0.5% protease inhibitor cocktail (Calbiochem). After centrifuga- 
tion, the supernatant was collected and protein content was evaluated by the 
Bradford assay. One hundred micrograms from each sample under reducing 
conditions were loaded into each lane and separated by electrophoresis on a 
10% SDS polyacrylamide gel. After electrophoresis, proteins were transferred to 
Immobilon transfer membranes (Tamar). Non-specific binding was blocked by 
incubation with TBST (10mM Tris-HCl (pH 8.0), 150 mM NaCl, 0.1% Tween 
20) containing 3% albumin from bovine serum for 1 h at 25°C. Membranes were 
subsequently incubated with antibodies against ASS1 (1:500, sc-99178, Santa 
Cruz Biotechnology)’, p97 (1:10,000, PA5-22257, Thermo Scientific), GAPDH 
(1:1,000, 14C10, 2118, Cell Signaling)”*, CAD (1:1,000, ab40800, abcam)”*, 
phospho-CAD (Ser1859) (1:1,000, 12662, Cell Signaling)'’, p70 $6 Kinase 
(1:1,000, 9202, Cell Signaling) and phospho-p70 S6 Kinase (Ser371) (1:1,000, 9208, 
Cell Signaling)”*. Antibody was detected using peroxidase-conjugated AffiniPure 
goat anti-rabbit IgG or goat anti-mouse IgG (Jackson ImmunoResearch) and 
enhanced chemiluminescence western blotting detection reagents (EZ-Gel, 
Biological Industries). 

Gels were quantified by Gel Doc XR+ (BioRad) and analysed by ImageLab 
4.1 software (BioRad). The band area was calculated by the intensity of the band. 
The obtained value was then divided by the value obtained from the loading 
control. 

RNA extraction and complementary DNA (cDNA) synthesis. RNA was extracted 
from cells by using PerfectPure RNA Cultured Cell Kit (5’-PRIME). cDNA was 
synthesized from 1 ug RNA by using qScript cDNA Synthesis Kit (Quanta). 

Quantitative PCR. Detection of ASS1 on cDNAs (see above) was performed using 
SYBR green PCR master mix (Tamar) and the required primers. Primer sequences 
were as follows. Human ASS1: forward, 5‘-TTATAACCTGGGATGGGCACC-3’; 
reverse, 5‘-TGGACATAGCGTCTGGGATTG-3’. Human HPRT: forward, 
5’-ATTGACACTGGCAAAACAATGC-3’; reverse,: 5/-TCCAACACTTCG 
TGGGGTCC-3’. Analysis used StepOne real-time PCR technology (Applied 
Biosystems). 

Transient transfection. Cells were seeded in 12-well plates at 30,000 cells per well, 
or in 10cm plates at 10° cells per plate, in triplicate. The following day, cells were 
transfected with either 20 pmol or 600 pmol siRNA siGenome SMARTpool targeted 
to Citrin mRNA (M-007472-01, Thermo Scientific), respectively. Transfection 
was performed with Lipofectamine 2000 Reagent (11668-019, Invitrogen) in the 
presence of Opti- MEM I Reduced Serum Medium (31985-062, Invitrogen). Four 
hours after transfection, medium was replaced and experiments were performed 
starting 24h after transfection. 

Infection. Over-expression. Cells were infected with pLenti3.3/TR and 
with pLenti6.3/TO/V5-DEST-based lentiviral vector with or without the 
human ASS1 transcript. Transduced cells were selected with 1 mg ml! Geneticin 
and with 7.5 ug ml“! Blasticidin for each plasmid, respectively. When induction 
of expression was needed, cells were added with 101g ml! tetracycline/ 
doxycycline. 

Short hairpin RNA. Cells were infected with pLKO-based lentiviral vector with 
or without the human ASS1 short hairpin RNA (shRNA) encoding one or two 
separate sequences combined (RHS4533-EG445, GE Healthcare, Dharmacon). 
Transduced cells were selected with 2 ug ml! puromycin. 

Arginine deprivation combined with drug treatments. U2OS human osteosar- 
coma cell line was seeded in 6-well plates at 80,000 cells per well. The following 
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day, cells were treated with either 100nM rapamycin (R0395, Sigma-Aldrich) or 
with 10 uM 5FU (F6627, Sigma-Aldrich) in regular medium, with 10% dialysed 
FCS-arginine-free-RPMI (06-1104-34-1A, Biological Industries) or with both 
arginine-depleted medium and one of these drugs. Rapamycin and 5FU were 
renewed into the medium every day, whereas fresh arginine-free medium was 
supplemented twice a week. 

Animal studies. According to the approved IACUC protocol 17270415-2, 
tumours did not exceed the limits of more than 10% of the animal weight 
and were not longer than 1.5m in length in any dimension (Supplementary 
Fig. 2). Ten million MALME-3m melanoma cells suspended in 500 pl PBS with 5% 
Matrigel (4132053 Corning) were injected subcutaneously to 8- to 12-week-old 
male SCID mice that were purchased from Harlan. There were 22 SCID mice, 
from which 5 or 6 were used for each cell line in each of the three experiments 
performed. No randomization was used. Mice were monitored for survival and 
tumour burden twice a week by a veterinarian investigator who was blinded 
to the expected outcome. Tumours were measured using a calliper. After 
euthanization, tumours were removed and incubated in medium containing 
[SN] glutamine for 6 h followed by GC-MS analysis. Tumour size was calculated 
as published”®. 

Building cell models. We used genome-scale metabolic models of NCI- 
60 cancer cell lines. The reconstruction method (on the basis of methods 
termed PRIME!” requires several key inputs: (1) the generic human model’; 
(2) gene expression data for each cell line!’; and (3) growth rate measurements 
(available at the NCI website: https://dtp.cancer.gov/discovery_development/ 
nci-60/cell_list.htm). The algorithm then reconstructs a specific metabolic model 
for each sample by modifying the upper bounds of reactions in accordance with 
the expression of the individual gene microarray values. 

Specifically, the model reconstruction process is as follows. (1) Decompose 
reversible reactions into unidirectional forward and backward reactions. (2) 
Evaluate the correlation between the expression of each reaction in the network 
and the measured growth rate. The expression of a reaction is defined as the mean 
over the expression of the enzymes catalysing it. (3) Modify upper bounds on 
reactions demonstrating significant correlation to the growth rate (after correct- 
ing for multiple hypothesis using false discovery rate) in a manner that is linearly 
related to expression value. 

Ass1 probes. Gcgcatccagggttataagc; gcggagcaggectgagagag; gcccaggcegcagtgg 
caga; geccagatgaaccactcagt; agtccgtgtagtctgctttc; gattataggtacagegtccct; ccttgctg 
gacatcttgtct; ctgtaggccagaaccacaga; gcaggagetetccaggccac; gttccttcagccacacgagg; 
taggcgatgacatcatagcc; cttctggccaatgttggcca; tcctggcttcctcaaagtct; gccccaagcttcag 
cgcctt; atcctcaatgaacacctttt; cttccacaaattccttgctc; tegacagcaggccagatgaa; gtcctcgta 
gagtgcactgg; gagagetgcccaggagatag; cgagctatgcaagecctggc; ctgggcaatctccacct 
gtc; acacatacttggccccttca; ccctttcccgtggcegccetg; ctcaaagcggacctggtcat; gtgccagt 
gaatagcaggte; ggagcgatgaccttaatctg; tgtaaaactcaggcatcctc; atcatttcggcccttgaacc; 
gttgctttgcatactccatc; gtgacagggatgeeeattcc; catactccaggsectcttgs; tgatgtgcatgag 
gttttca; tccaggatcccagcctcata; aggtgcttgattcttggest; gagtttttgtgtagagaccc; tigg 
gtgctttggcaggetc; tatctcaaggacatctgggc; cagggaccccttttttgaat; tctttgatgttgetcacctt; 
ggatgtggtecegsttgtgc; tcaggtacatgaagagttcc; ccgtgcttgcccgcaacttc; cacgatgtcaatg 
cgaccca; tcattccaatgaagcggttc; gtctcgtagatacctcggga; gtaaaggatgestccctgctg; cctc 
tatgtctaaatgagcg; acttcccgatccatcgtgaa; caggccctgcttgatt; cgagctctgcgaatttgage; 
ctgtgccagaaacctgtgta; gcgaacaaattcacattcag; cctgggacttctggatacag; tgcaccttcccttc 
tacccg; ttggcccttgaagacagaca; actcccgaccgaggatgtac; tcattgtagagtgaaagtge; tgcacgt 
tcatgctcaccag; gtcgatgsectcatagtcgc; tgatattgatgaagccagtg; tactccttcagcctgagcga; 
gaccttgctctgaagecgat; ttgtcaggetctatttggca; gastggegagecccectcct; gctgaagcctggga 
gagctg; caaatttatcacaacaatta; getegagaacaagctacaat; gacacagcagccccagtcag; aggctgt 
gegeeseceseeo: sctatagggeaccaggeaac; cctiggatgaccacttttgt; agctgcccgccaccctccct; 
attgtcattttatgctttct; aagactaatgtaacttcttt. 

Statistics. All statistical analyses were performed using Tukey’s honest significant 
difference test or independent-samples Student's t-test of multiple or two groups, 
respectively. Log-transformed data were used where differences in variance were 
significant and variances were correlated with means. The sample size was chosen 
in advance on the basis of common practice of the described experiment and is 
mentioned for each experiment. No statistical methods were used to predeter- 
mine sample size. Each experiment was conducted with biological and technical 
replicates and repeated at least three times unless specified otherwise. On the 
basis of pre-established criteria, individual outlier data points that were more than 
2 standard deviations away from the mean were excluded from the data analysis. 
Statistical tests were done using Statsoft’s STATISTICA, version 10. All error bars 
are standard errors. P < 0.05 was considered significant in all analyses (*P < 0.05, 
**P < 0,005, ***P < 0.0005). 

Kaplan-Meier. For each cancer type, the Kaplan-Meier plot indicates the sur- 
vival rates of the four different groups of patients as labelled. We analysed the 
cancer types for which there were sufficient survival data. 
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Extended Data Figure 1 | ASS1 deficiency correlates with aspartate 
utilization by CAD in cancerous and non-cancerous cells. 

a, Schematic flux tracing of the a-labelled nitrogen of glutamine ([°N] 
a-glutamine) to nucleic acid synthesis via aspartate. b, The ratio between 
M+ 1-labelled and total uracil in fibroblasts is similar between patients 
with citrullinaemia patients and control subjects ( > 3). Error bars are 
standard error. c, Labelled levels of M+ 1 aspartate (left) and M + 1 uracil 
(right) synthesized from [‘*N]a-glutamine are higher in fibroblasts from 
CTLN I than in fibroblasts from controls and patients with CTLN II 

(n> 3). d, TCGA analysis of tumour-normal paired tissues for gene 
expression comparison shows the expression levels of ASL and ASS1 

in different cancers. e, Plot generated from the modelling data for the 
production capacity of metabolites after ASS1 inactivation in each of 


the NCI-60 cell lines as well as in the generic model. The reddish bars 
represent the ranking of nucleic acids while the blueish bars represent the 
ranking of all other metabolites. f, Correlation analysis of NCI-60 cell lines 
shows a significant inverse correlation between ASS] and CAD expression 
levels. g, Osteosarcoma (upper) and melanoma (lower) microarray data 
was obtained from the NCBI EO database (accession numbers GSE33383 
and GSE46517, respectively). Raw expression levels were plotted and 
significance was computed using a t-test on log,-transformed expression 
levels. The number of patients for each subtype is shown in parenthesis 

on the left. h, Western blot for CAD and ASS1 shows higher expression 
levels of CAD in the MNNG/HOS human osteosarcoma cell line, which 
has a low expression level of ASS1 compared with U20S, which has higher 
expression levels of ASS1; p97 is shown as loading control. 
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Extended Data Figure 2 | ASS1 inactivation in melanoma correlates 
with increased proliferation. a, An immunoblot showing different 
expression levels of ASS1 and CAD in two different cancer cell lines 

of melanoma. b, Melanoma cells with ASS1 downregulation have a 
significant increase in pyrimidine levels as measured by LC-MS (n > 3). 
c, Melanoma cells with ASS1 downregulation have a significant increase 
in total uracil (n= 4). d, Melanoma cells with ASS1 downregulation 
have a significant increase in proliferation as measured by MTT 

assay (n = 2). e, Immunoblots of melanoma cells for ASS1 levels after 
transduction with either ASS1 over expression construct or with shASS1. 
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f, Proliferation assays showing a significant decrease in proliferation 
after ASS1 overexpression in melanoma using MTT (n=3). g, Crystal 
violet quantification for melanoma cells after transduction with shASS1 
demonstrating increase in proliferation (n = 3). h, LC-MS measurements 
of pyrimidine levels showing a significant increase after the use of 
shASS1 in melanoma cells (n > 3). i, Left: total uracil levels are decreased 
significantly in melanoma cells with ASS1 overexpression and increased 
in melanoma cells with shASS1 (right) (n > 2). j, Significant increase in 
proliferation of melanoma cells by dNTPs after ASS1 overexpression 
(n= 3). All error bars are standard errors. 
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Extended Data Figure 3 | Downregulation of ASS1 levels increases 
pyrimidine synthesis. a, Osteosarcoma cells were transduced with two 
different shASS1 vectors: shASS742 and shASS745. Both clones decreased 
ASS] levels efficiently to approximately 20% expression (left), resulting in 
a significant increase in uracil M+ 1 levels (b) and in proliferation (n > 3) 
(c). d, RNA levels measured in U2OS at 24 h intervals show increased 
levels of RNA in U2OS infected with shASS1 compared with the empty 


vector. e, Uracil M+ 1 levels increase more in U2OS infected with shASS1 
compared with the empty vector during 38 h of measurements. f, The 
levels of total and labelled M + 1 alanine synthesis from [‘°N]a-glutamine 
do not change significantly after ASS1 downregulation (n = 3). g, Tumours 
with shASS1 had higher levels of M+ 1 aspartate (left) and M+ 1 uracil 
(right) synthesized from ['*N]a-glutamine, compared with tumours 
expressing the empty vector (n= 15). All error bars are standard errors. 
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Extended Data Figure 4 | Cancers with ASS1 downregulation are 
addicted to aspartate. a, b, Analysis of the TCGA database of matched 
tumour-normal pairs showing no significant difference in the expression 
level of citrin in tissues with a high baseline expression of citrin (a) and 
significant elevation in tumours in which the normal tissue has low 

basal expression of citrin (b) (* P< 0.001). c, Immunoblot showing 

the expression level of citrin in osteosarcoma cells after si-citrin. 

d, Labelled and unlabelled aspartate (top) and uracil (bottom) are elevated 
significantly in cancers with ASS1 downregulation and are comparable 

to control in cells with both ASS1 and citrin downregulation (n > 3). 
Error bars are standard errors. e, Kaplan-Meier survival analysis for two 
different cancer types (BRCA, breast cancer (top); LUSC, lung squamous 
cell carcinoma (bottom)), showing poor survival trend for cancers with 
low ASS1 and high citrin. For each cancer type, the Kaplan-Meier plot 
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indicates the survival rates of four groups of patients: (1) ASS1 low 
expression and citrin high expression; (2) ASS1 low expression; (3) citrin 
high expression; (4) none of these. We analysed the cancer types for which 
there were sufficient survival data. f, Quantification graph of a western 
blot showing decreased CAD and S6K phosphorylation after treatment 

of U20S with si-citrin. Error bars are standard errors. g, Proximity 
ligation assay showing increased proximity between CAD and citrin 

after ASS1 knockdown in U2OS cells (top, red dots). The left and middle 
pictures show the proximity between ASS1 and CAD to citrin in U2OS 
infected with empty vector, whereas the right picture shows the proximity 
between CAD and citrin after infection of U2OS with shASS1. Bottom: 
quantification of proximity ligation assays performed on U2OS infected 
with either empty vector (EV) or with shASS1 using antibodies for citrin, 
ASS1 and CAD. The pictures were quantified using Image]. 


© 2015 Macmillan Publishers Limited. All rights reserved 


LETTER 
Extended Data Table 1 | ASS1 inactivation is predicted to increase aspartate flux for nucleic acid synthesis 


Pyrimidine Biosynthesis (790.1), CAD | | 847-198 
IMP Biosynthesis (10606.1) PAICS <1e-300 


. (159.1) Adenylosuccinate 
Nucleotides 
synthase 1.55E-265 


Predicted fold-change in flux rates through pathways associated with aspartate and glutamine, when comparing ASS1 inactivation with activation state. The most significant change is predicted to 
effect the pyrimidine biosynthesis pathway followed by purine synthesis pathway (two-sided Wilcoxon rank-sum P value <8.4 x 10-198, Methods). 
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Extended Data Table 2 | Kaplan-Meier log-rank data analysis shows significant worsening in the survival of patients with low ASS1 and high 
citrin expression levels in bladder cancer and lung adenocarcinoma 


Number of patients ASS1lowonly Citrinhighonly None 
ASS1 low Citrin high 26 0.062029244 0.41802416 
ASS1 low only 437 0 0.939396939 0.133262071 
Citrin high only 83 0 0 0.397997761 
None 415 

Number of patients ASSilowonly Citrinhighonly None 
ASS1 low Citrin high 25 0.091470414 0.348538412 0.215689269 
ASS1 low only 76 0 0.513641931 0.247876216 
Citrin high only 14 0 0 0.784222648 
None 90 

Number of patients ASS1lowonly§ Citrinhighonly None 
ASS1 low Citrin high 34 0.086250611 
ASS1 low only 16 0 0.989469401 0.778260123 
Citrin high only 181 0 0 0.440612674 
None 189 

Number of patients ASS1lowonly Citrinhighonly None 
ASS1 low Citrin high 56 0.889254024 0.131597923 0.271250284 
ASS1 low only 35 0 0.234416519 0.540649678 
Citrin high only 173 0 0 0.710240332 
None 127 
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The table shows the number of patients for whom data were available in each group, as well as the pairwise P value of comparison between the corresponding groups: BLCA, BRCA, LUAD, LUSC. 


LETTER 


doi:10.1038/nature15518 


DNA-dependent formation of transcription factor 
pairs alters their binding specificity 


Arttu Jolma!, Yimeng Yin!, Kazuhiro R. Nitta!, Kashyap Dave!, Alexander Popov’, Minna Taipale!, Martin Enge!, Teemu Kivioja’, 


Ekaterina Morgunova! & Jussi Taipale! 


Gene expression is regulated by transcription factors (TFs), proteins 
that recognize short DNA sequence motifs'*. Such sequences 
are very common in the human genome, and an important 
determinant of the specificity of gene expression is the cooperative 
binding of multiple TFs to closely located motifs*®. However, 
interactions between DNA-bound TFs have not been systematically 
characterized. To identify TF pairs that bind cooperatively to DNA, 
and to characterize their spacing and orientation preferences, 
we have performed consecutive affinity-purification systematic 
evolution of ligands by exponential enrichment (CAP-SELEX) 
analysis of 9,400 TF-TF-DNA interactions. This analysis revealed 
315 TF-TF interactions recognizing 618 heterodimeric motifs, 
most of which have not been previously described. The observed 
cooperativity occurred promiscuously between TFs from diverse 
structural families. Structural analysis of the TF pairs, including a 
novel crystal structure of MEIS1 and DLX3 bound to their identified 
recognition site, revealed that the interactions between the TFs were 
predominantly mediated by DNA. Most TF pair sites identified 
involved a large overlap between individual TF recognition motifs, 
and resulted in recognition of composite sites that were markedly 
different from the individual TF’s motifs. Together, our results 
indicate that the DNA molecule commonly plays an active role in 
cooperative interactions that define the gene regulatory lexicon. 

The set of rules by which a DNA sequence can be converted into 
knowledge of spatial and temporal expression patterns of a protein 
has been difficult to decipher! ?. This is in part because, in mammals, 
more than 1,000 TFs recognizing over 200 different short DNA 
motifs participate in interpreting gene regulatory information”~’®. In 
addition, TFs also interact with each other, and many TFs bind DNA as 
homo- or heterodimers. A pair of TFs can bind to multiple different 
DNA motifs, as the recognition sites of individual TFs can occur in 
different orientations and/or spacings relative to each other’!~!°. Most 
known heterodimeric interactions occur between two TFs of the same 
structural family, but several cases where TFs of different structural 
classes bind cooperatively have also been identified*”. 

To chart the prevalence of co-operative interactions between TFs 
in the presence of DNA, we developed a novel method, CAP-SELEX, 
in which specific DNA sequences that interact with two different TFs 
at the same time are selected from a library of random sequences. 
CAP-SELEX only detects a specific type of interaction involving three 
macromolecules, where the two tested proteins both bind to the same 
DNA ina sequence-specific manner (Fig. 1a). Compared to existing 
methods such as SELEX-seq!* and universal protein binding micro- 
arrays”!°, CAP-SELEX respectively allows higher throughput and inter- 
rogation of larger sequence space. A total of 100 streptavidin-binding 
peptide (SBP) tagged TF1 proteins were used in the assay together with 
94 (3 x Flag-tagged) TF2 proteins to characterize 9,400 potential inter- 
actions (Fig. la and Supplementary Table 1). TFs were selected to cover 
a wide variety of structural classes and individual binding specificities. 


To allow bacterial expression, unstructured regions and amino- and 
carboxy-terminal sequences that do not correspond to known protein 
domains were removed from the constructs. 

Co-operative signals were detected from CAP-SELEX enriched 
sequences (Extended Data Fig. 1) either as novel composite sites that 
combine partial specificities of the two TFs (using Autoseed'*), or as 
orientation and spacing preferences between the individual TF’s motifs 
(Supplementary Table 2). This analysis revealed that 55% (55/100) of 
TF1 and 70% (66/94) of TF2 proteins were ‘active’ in the CAP-SELEX 
assay, as indicated by identification of the expected pair of monomeric 
sites in at least five experiments, or a heterodimeric site in at least one 
experiment (Extended Data Fig. 2 and Supplementary Table 1). To test 
reproducibility of the assay, 10 TF1 proteins were run again against all 
TE2 proteins. In most cases, the recovered motifs were very similar. 
In addition, we validated 10 TF-TF pairs using purified full-length 
proteins (Extended Data Fig. 3). 

Of the 3,630 tested interactions between active TF1-TF2 pairs, 315 
(8.7%) displayed cooperative binding. This result is likely an under- 
estimate, as enrichment of both expected motifs was not observed in 
many cases (83% of all tested pairs, not shown). The interactions were 
not limited to those between related TFs, and also occurred commonly 
between TFs from different structural families. Only 5% of all active 
TF1 and TF2 pairs appeared to bind to DNA independently of each 
other, as indicated by the presence of both expected motifs without 
strong orientation and spacing preferences (Extended Data Fig. 2). 

Of the interacting TF pairs, 162 had only one preferred site, whereas 
153 pairs displayed more than one spacing and/or orientation 
(Fig. 1b, c). Analysis of pairs of motifs that occurred in the same orien- 
tation revealed that the stringency of their spacing was dependent on 
the motif-to-motif distance. Most TF pairs whose motifs overlapped 
preferred just one (negative) spacing between the motifs. In contrast, if 
the most enriched motif pair had a gap, two or more spacings were more 
commonly observed. Most longer-range interactions where the gap 
between the motifs was 3 base pairs (bp) or more displayed a relatively 
wide, +2 bp, spacing preference, similar to that reported previously!” 
(Fig. 1d and Extended Data Fig. 4). Many TF pairs displayed both kinds 
of interactions, with one orientation preferring stringent short-range 
interactions, and the other orientation(s) preferring the more relaxed 
long-range interactions (not shown). 

In cases where two or more motif spacings were allowed in the same 
orientation, the differences between the observed motif-to-motif 
distances were in general very small (Fig. le, >73% were only 1 bp), 
indicating that the mechanism of the TF-TF cooperativity is very sen- 
sitive to the relative distance and/or angle between the bound TFs. The 
promiscuous nature of the cooperativity was highlighted by the fact 
that most cases where more than one preferred mode of binding was 
observed involved multiple motif orientations (Fig. 1f). Two orienta- 
tions was most common, whereas fewer cases of three or four orien- 
tations were observed, in part because pairs with one or two TFs with 


1Department of Biosciences and Nutrition, Karolinska Institutet, SE 141 83, Sweden. European Synchrotron Radiation Facility, 38043 Grenoble, France. ?>Genome-Scale Biology Program, 


University of Helsinki, P.O. Box 63, Fl-00014, Finland. 
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Figure 1 | CAP-SELEX reveals DNA-mediated TF-TF interactions. 

a, Schematic description of CAP-SELEX. A TF1-TF2-DNA complex is 
formed (top left) and subjected to two consecutive affinity purifications, 
followed by amplification of DNA and sequencing. The entire process is 
repeated three times, and the cooperative complexes are then detected 
from the sequences. CAP-SELEX derived PWM models for the indicated 
previously known?°*8 TF complexes are also shown. b, An example of a 
TF-TF pair preferring a single spacing and orientation. Heatmap shows 
counts (divided by max) of representative 6-mers for the TFs. c, A TF 
pair preferring two different orientations, with relatively flexible spacing 
in both orientations. Logos for the strongest cases are also shown. 

d, High stringency of closely packed TF-TF sites. The pie charts show the 
number of allowed spacings in a single orientation, binned according to 
the motif-to-motif distance (gap) of the strongest-bound site (maximum). 
e, TFs that can bind to sites with multiple spacings prefer very closely 
spaced sites. Histogram shows difference in gap length between the most 
strongly enriched motif spacing (normalized to 0) and other identified 
motif spacings. f, Most TF-TF pairs with multiple cooperatively bound 
sites allow more than one orientation. Pie chart shows frequency of TF-TF 
pairs binned according to the number of allowed orientations. Only pairs 
with multiple preferred motifs are included. 


palindromic recognition sequences can only have two or one orienta- 
tions, respectively. These results indicate that TF-TF cooperativity is 
widespread and not just mediated by the highly specific protein-protein 
interactions observed in previously described canonical heterodimers. 
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Figure 2 | Overlapping composite TF motifs with novel specificity. 
a, An example of a TF pair binding to an overlapping composite site. 
Top, a composite GCM1-ELK1] logo aligned to the individual logos. 
Middle, DNA-protein contacts for GCM1 (purple) and ELK1 (light blue) 
in the composite site, predicted based on GCM1-DNA and ELKI-DNA 
structures*?°, Dots indicate the number of hydrogen bonds between the 
TF and DNA backbone (ovals) and bases (rectangles) occurring via major 
(filled circle) and minor (open circle) grooves. Bottom, a schematic model of 
GCM1-ELK1 heterodimer. DNA is shown as idealized B-DNA. b, TFs prefer 
to bind to sites where their core motifs are closely spaced. Histogram of gaps 
observed between all full width motifs (core plus flank) and core motifs. 
Gap widths were counted for all TF pairs identified in this study for which 
structural data was available. Examples of calculating distance using full 
PWMs (above x axis) and core motifs (below x axis) are also shown. 


Some TF pairs displayed strong orientation and spacing preferences, 
without major changes in either motif (Fig. 1b, c). However, in a large 
number of cases (207), the specificity of the pair of TFs differed 
markedly from that expected from the individual motifs (Fig. 2a). These 
differences were observed when the two TFs were close to each other. To 
understand the mechanism of the altered specificity, we analysed avail- 
able structures for the studied TFs and their paralogues (Supplementary 
Data Set 2). Based on the analysis, 95% of the complexes are consistent 
with either a completely DNA-mediated mechanism, or a DNA- 
facilitated mechanism, where the interaction between the proteins is 
scaffolded by DNA and limited to few amino-acid contacts; only 5% 
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Figure 3 | All identified TF-TF interactions. a, PWM motif similarities 
between the heterodimer motifs (green bars) and monomeric and 
homodimeric representative motifs from ref. 8. Barcode logos for each 
factor are shown, and background colour of name indicates TF structural 
family. Center of dendrogram shows comparison of sequence and barcode 
logos, the colour key, and the number of all PWMs and representative 
PWMs. Inset (bottom right), fold enrichment of matches of the motifs 

in known TE clusters from human colon cancer cells!’. b, A network 
representation of the very similar heterodimeric sites formed between 
multiple FOX and ETS proteins. Note that a similar site is recognized when 


of the complexes appeared to form extensive protein-protein inter- 
action surfaces (Extended Data Fig. 5). We next generated position 
weight matrices (PWMs) that included information about hydrogen 
bond contacts between the TF amino acids, and DNA bases and 
backbone (Supplementary Table 3). Alignment of pairs of such 
contact-annotated PWMs to their respective composite models 
revealed that the changes in binding specificity mostly affected base 
positions that are recognized by the TFs via contacts to the DNA back- 
bone. In contrast, ‘core’ bases directly read via hydrogen bonds were 
rarely affected (Fig. 2b and Extended Data Fig. 6). 

In total, we recovered 618 PWM models describing the 
specificities of the TF-TF pairs. To globally analyse the collection, 
we identified distinctly different ‘representative’ motifs from a 
combination of our data set and previous high-throughput SELEX 
(HT-SELEX) data®. Out of all representative motifs, 61% were TF 
pair motifs identified in this study (Fig. 3 and Extended Data Fig. 7), 
suggesting that a very large fraction of TF specificity space is 
defined by TF heterodimers. A dendrogram displaying the similar- 
ity of the heterodimeric motifs revealed that paralogous proteins 
often shared the same partners, and bound to similar heterod- 
imeric motifs (Fig. 3a). A total of 63 of such motif groups were 
identified, representing 239 TF pairs. For example, many ETS factors 
formed complexes with forkhead proteins and with the posterior 
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the FOX protein is either used as TF1 (FOXO1, FOXJ2) or TF2 (FOXI1). 
Similar conserved motif is also shown”®. c, HOXB2-PAX1, HOXB2-PAX5 
and HOXB2-PAX9 heterodimer sites are similar to a site for PAX3, which 
contains both Pax- and homeodomains. d, FOXO1-HOXB13 and 
MEIS1-HOXB13 heterodimers validated by ChIP-exo. e, Binding of 
TEAD4 together with the indicated ETS TFs makes their divergent 
flanking recognition sequences (left box) more similar to each other 
(right box). f, HOXB2 reveals latent specificity of the TFs indicated. Inset 
shows conserved genomic motif” that is similar to the ELK1-HOXB2 
motif. 


homeodomain TFs HOXD12 and HOXB13, binding to highly simi- 
lar composite sites (Fig. 3b and Supplementary Table 2). Furthermore, 
PAX proteins containing only a paired domain interacted with 
homeodomain-containing partners, binding to sites that were similar 
to those recognized by PAX proteins that include both paired domains 
and homeodomains. This suggests that this site predates the joining 
of the paired domain and homeodomain to the same gene (Fig. 3c). 
HOXB13 also formed complexes with forkhead and MEIS pro- 
teins. The motifs recovered using CAP-SELEX were also enriched by 
HOXB13 ChIP-exo!®. The preferred dimer partner of HOX13 was cell- 
line specific, suggesting that TFs dimerize with different proteins in 
different cell types (Fig. 3d). The inclusion of HOXB13 and MEIS1 
dimer motifs also improved prediction of the corresponding ChIP- 
seq peak positions (Extended Data Fig. 5 and Supplementary Table 4). 
The novel motifs were enriched in ChIP-seq-identified TF cluster 
sequences” (Fig. 3a); a larger fraction (52%) of the novel motifs were 
enriched compared to the monomer motifs (34%, Supplementary 
Tables 2 and 4). Furthermore, comparison to motifs discovered 
de novo” and our independent analysis revealed that many (24%) 
of the motifs were enriched in mammalian conserved sequences 
(Supplementary Table 2 and Extended Data Figs 5 and 8). A total of 390 
of 618 motifs were found enriched in human TF clusters and/or mam- 
malian conserved sequences. Both of these methods have a relatively 
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Figure 4 | Structural validation of TF-TF interactions. a, Positions of 
MEIS1 partner TFs in relation to the 7 bp MEIS1 motif® (cyan, orientation 
NTGACAN). Note that the partners bind to different positions, spanning 

a 26-bp region. Modelling suggests that in all pairs except MEIS1—ELF1, the 
proteins do not interact extensively (Supplementary Data Set 2). Asterisks 
indicate that the corresponding genomic motif matches are conserved in 
mammals (see Supplementary Table 2). b, Structure of MEIS1-MEIS1-DNA 
complex. Top, Heatmap based on HT-SELEX data® showing occurrence of 
MEIS1 subsequence TGACA in the orientations indicated (arrowheads, 
scale divided by highest count). The preferred spacing and orientation 

is indicated by yellow outline. Bottom, structure of two MEIS1 proteins 


high false-negative rate, due to differences in dimers in different cell 
types, and the requirement that >50 motif occurrences are conserved 
to reach statistical significance. These results indicate that motifs 
identified in this study are biologically relevant. 

Heterodimeric partners could also mask differences in binding 
specificities of individual TFs. For example, class I, II and HI ETS 
factors ELK1, ELF1 and SPIB, respectively, prefer different 5’ flank 
sequences®”', but this difference is effectively masked by TEAD4 
(Fig. 3e). This effect was rare; only two other similar cases were iden- 
tified (Supplementary Data Set 1). Conversely, partners could be iden- 
tified that revealed ‘latent specificity’! of TFs, defined as binding of 
TFs to different heterodimeric sites, even when their primary specif- 
icities are indistinguishable. For example, ETV1, ET V4, ELK1 and 
ELK3 bind to similar monomeric sites, and also bound to similar het- 
erodimeric sites with GCM1 (Supplementary Table 2). However, with 
HOXB2, ETV1 and ET V4 bound to one type of site, and ELK1 and 
ELK3 to another site (Fig. 3f). The ET Vs are more closely related to 
each other than to the ELKs, suggesting that latent specificity evolves 
faster than primary specificity. Four other similar cases were identified 
(Supplementary Data Set 1). 

To analyse the mechanisms of cooperativity, we studied all identified 
dimers that included the TALE-class homeodomain MEIS1. Twelve TFs 
from six TF families bound to diverse but specific positions at either 
side of MEIS1 (Fig. 4a). To understand the basis of such interactions, we 
solved the structure of MEIS1 bound to DNA alone, as a homodimer, 
and as a heterodimer with DLX3 using X-ray diffraction (3.5 A, 1.6A 
and 3.5A resolutions, respectively). In the homodimer structure, the 
two monomers are on opposite sides of DNA and do not contact each 
other, indicating that the observed cooperativity is entirely mediated 
by DNA (Fig. 4b). 

Interaction between MEIS1 and DLX3 in CAP-SELEX is much 
stronger than that observed for the MEIS1-MEIS1 dimer®. The proteins 
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bound to the preferred site. TGACA and its reverse complement sequence 
are in cyan. c, Structure of MEIS1-DLX3-DNA complex. Top, heatmap 
showing occurrence of MEIS1 and DLX3 5-mer subsequences TGACA 
(black arrowhead) and AATTG (red arrowhead), respectively. Bottom, 
crystal structure of MEIS1 and DLX3 bound to the preferred site. Note the 
narrowing of the DNA minor groove between the two proteins. d, DLX3 
Arg132 (orange) inserts into the minor groove of DNA, and positions two 
adjacent serines and a tyrosine (yellow) so that an aspartate from MEIS1 
hydrogen bonds (dotted lines) with DLX3 peptide backbone. Bottom, 
conservation of the residues in homeodomain proteins. Residues conserved 
in all human DLX proteins are in bold. 


interact, but the contact surface is very small, covering only 2.0% of the 
solvent accessible surface of the dimer. However, the DNA between 
the proteins is significantly deformed, narrowing the minor groove 
(Fig. 4c and Extended Data Fig. 9). This facilitates interaction between 
the proteins, as Arg132 of DLX3 inserts into the minor groove, posi- 
tioning the conserved amino acids Tyr135, Ser136 and Ser137 in such 
a way that the peptide backbone makes three hydrogen bond contacts 
with Asp336 of MEIS1 (Fig. 4d). 

In summary, our sampling of a large number of TF-TF interactions 
revealed a much greater number of interactions than previously 
reported. Many novel DNA motifs were enriched in ChIP-seq TF 
clusters and conserved in mammalian genomes. Based on the fraction 
of pairs tested, we estimate that ~25,000 distinct TF pair specificities 
contribute to protein-DNA interactions in cells (Supplementary Table 
4). The frequent interactions between TFs, together with nucleosome- 
mediated cooperativity”? are consistent with the observation that 
TF binding in cells occurs in dense clusters. The clusters are also likely 
to be stabilized by TF co-factors, and complexes such as Mediator, 
cohesin and RNA polymerase II!?*?°. Such higher-order complexes, 
and complexes between TFs formed by domains that were not included 
in our constructs (Supplementary Table 1) are likely to further 
contribute to the cooperativity of TF binding in vivo. 

Most of the observed interactions involve close packing of the 
individual TF’s core motifs, and overlap between the motif flanks. 
The sequence between the core motifs commonly differs from that 
expected from the individual motifs. The composite sites would often 
be recognized by the individual TFs, but with relatively low affinity. Our 
findings are thus consistent with the general low affinity of sites bound 
in vivo in ChIP-seq experiments, and the fact that the conservation pat- 
tern of many regulatory elements extends beyond known TF binding 
sites. Taken together, our results show that cooperativity is an inherent 
feature of TF-DNA binding, and that DNA itself functions as an active 
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interacting partner, commonly facilitating the interactions between a 
wide range of TFs from diverse structural families. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Sequencing and data analysis. Unselected initial libraries and products of the third 
selection cycle were purified using a PCR-purification kit (Qiagen) and sequenced 
using [lumina HiSeq 2000 (multiplexed as in ref. 31; 55 bp single-read length). 
Raw sequence reads were demultiplexed, and initial quality control was performed 
using IniMotif*!. Sequencing depth was set in such a way that on average each 
experiment would result in 250,000 sequence reads. Based on previous enrichment 
ratios, this should lead to more than 1,000 highly enriched seed subsequences to be 
detected (count statistics; Poisson distribution, 3.16% standard deviation; expected 
background for 10bp seed 15 counts, P value 1.26 x 10°?” using winflat, expected 
15, observed >1,000; Bonferroni corrected P value < 1.32 x 10-767), Average 
background-corrected count for the seed match at the indicated multinomial 
setting was 3,295 (Supplementary Table 2). All sequence data has been deposited to 
ENA (European Nucleotide Archive) under accession number PRJEB7934. 

To identify overlapping composite sites, we used the AUTOSEED tool described 
in Nitta et al.'°. This tool is based on identification of gapped and ungapped sub- 
sequences that represent a local maxima within a Huddinge distance’ of 1; that 
is, they are more enriched than all subsequences that align with them with k—1 
perfectly matching bases, where k is the length of the ungapped subsequences. 
Cell culture and ChIP-exo. LoVo (source: ATCC, cat. no. CCL229TM), 293FT 
(source: Thermo Fisher Scientific, cat no. R700-07) and GP5d (source: ECACC, 
cat. no. 95090715) cells were cultured in DMEM supplemented with 10% fetal 
bovine serum (FBS) and antibiotics. Cells were obtained directly from the indi- 
cated source, and tested and found negative for mycoplasma contamination by 
immunofluorescence analysis after staining with 3.3 4g ml! bisBenzimide H 
33342 trihydrochloride (Sigma cat no. B2261). All antibodies used in ChIP-exo 
experiments were ChIP-grade. In each experiment a non-specific IgG was used 
as a control. ChIP-exo was performed essentially as described in Rhee and Pugh!® 
with modifications from ref. 32 using antibodies for HOXB13, MEIS1 and rabbit 
IgGs (Santa Cruz Biotechnology and Abcam cat. numbers sc-66923X, ab19867, and 
sc-2027, respectively). See Supplementary Table 1 for the sequences of the Illumina 
sequencing adapters. Sequence reads were mapped to the human reference genome 
(hg18), using bwa with default parameters. For peak-calling, we used GEM* with 
default parameters, and the genome size set to 2,700,000,000. 

Construct design. TFs were selected to cover different structural classes and indi- 
vidual binding specificities. Thus, in the set, small TF families such as TEA and 
GCM are relatively overrepresented, and large families such as CH) zinc fingers 
and canonical homeodomains are underrepresented. The expression constructs 
contained the DNA-binding domain, and known dimerization and protein-protein 
interaction domains for TF families where such domains are known to be required 
for DNA binding. These included, for example leucine zipper domains of bZIP 
and bHLH proteins, pointed-domains of ETS factors, dimerization domains of 
nuclear receptors as well as short motifs such as ‘YPWM’ of the anterior homeo- 
domains that are known to be involved in protein-protein interactions* (see 
Supplementary Table 1 for full sequences and removed and retained domains). 
Flanking sequences of 15 amino acids were also included on both sides to allow 
folding of the known protein domains, and to retain amino acids that are located 
close to DNA and could mediate interactions between closely packed TFs. We 
have previously shown that such constructs recover accurate binding specificities 
and homodimeric interactions by analysis of 125 pairs of such constructs and the 
corresponding full-length TF proteins*!*. 

Protein expression, purification and activity testing. Bacterial protein expression 
Gateway recipient vectors that incorporated a N-terminal Thioredoxin-6 x His tag, 
with either a C-terminal streptavidin binding peptide (SBP) or 3 x Flag tag were 
constructed using pETG20A-plasmid as a backbone. Inserts for protein expres- 
sion were derived either by gene synthesis (Genscript; see Supplementary Table 
1 for protein sequences and domains), or from previously published Gateway 
donor clones’. All proteins were expressed in the Rosetta 2(DE3)pLyss E. coli 
strain as fusion proteins using the auto-induction protocol described in Jolma 
et al.®, Proteins were purified using nickel affinity purification (GE Nickel sepharose 
Fast-Flow6, GE, Sweden) and stored at —20°C in 50% glycerol, 150 mM NaCl, 
250 mM imidazole, 15 mM Tris-Cl, pH 7.5. 

Protein expression and purification from E. coli cells was performed as described 
in Jolma et al.®. Briefly, the expression system used Rosetta 2(DE3)pLysS strain of 
E. coli (Millipore) cultured in ZYP5052 autoinduction media, where the expression 
of proteins is induced upon consumption of the preferred carbon source (glucose). 
Transformed cells were first cultured in deep well 96-well plate (Thermo, AB0661) 
wells in TB-medium at 37°C for overnight and then transferred to the auto- 
induction medium (1:40 dilution, see Vincentelli et al.>°). When the cell density 
was between 2.0 and 3.0 optical density at 600 nm, the temperature was lowered to 
17°C, and the culture continued for 40h. The cells were harvested by centrifuga- 
tion (4,000 rpm for 15 min), and lysed by incubation with buffer A (300 mM NaCl 
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in 50 mM Tris-Cl, pH 7.5) containing 10mM imidazole, 0.5 mg ml! lysozyme 
(Sigma) and 1 mM PMSF (Sigma). The lysis was completed by a freeze-thaw cycle. 
DNase I and MgSO, were added to the thawed lysate at 10 ug ml! and 1 mM final 
concentration, respectively, and the lysates incubated with Ni-Sepharose 6 Fast 
Flow resin (GE Healthcare) and shaken for 45 min. The lysate was then transferred 
to a filter plate (Nunc, 278011, 20 um pore size), and the beads washed two times 
each with 600 ul of 10mM and 50mM imidazole in buffer A using a vacuum 
manifold. The bound proteins were eluted from resin using 500 mM imidazole in 
buffer A. The expression of the purified proteins were checked by UV absorbance 
at 280 nm and SDS-PAGE electrophoresis (E-PAGE protein gels, Invitrogen) and 
Coomassie brilliant blue staining. 50% glycerol was added to the proteins before 
storage at — 20°C. 

In most cases the activity of the proteins was assessed by HT-SELEX*"", and pro- 
teins that robustly enriched expected sequences were included in the CAP-SELEX 
process. As some TFs are only expected to bind to DNA as a heterodimer, we also 
included in CAP-SELEX some proteins that did not robustly enrich sequences in 
HT-SELEX. These included the known obligate heterodimers MYC, PBX1, PBX2 
and PBX4. All included proteins are indicated in Supplementary Table 1. The 
HT-SELEX analysis yielded expected binding sites for most individual TFs, and 
in addition resulted in identification of novel motifs for TFs (see Supplementary 
Table 2). HT-SELEX was also used to validate some of the CAP-SELEX results 
with full length TFs. In these cases the pair of proteins were mixed together in an 
~1:1 ratio before the further steps of the HT-SELEX (see Supplementary Table 1 
for the details of the clones). 

In most cases, a scaled-up culture (50 to 100 ml) was used to express more of 

the TF constructs for CAP-SELEX. The protocol used was similar to that used 
for the deep-well plate cultures, except that proportionally larger lysis and wash 
volumes were used, and that 1 ml Ni-Sepharose 6 Fast Flow gravity columns 
(GE Healthcare) were used as the affinity matrix. 
CAP-SELEX assay. Previous systematic efforts that have focused on heterodimer- 
ization between TFs have generally studied proteins that dimerize in the absence 
of DNA*°-38, However, some cases of TFs that cannot dimerize in the absence of 
DNA, or only interact with each other indirectly through DNA have been described 
in the literature**"°, The CAP-SELEX process can capture both types of inter- 
actions, and is based on a combination of HT-SELEX®*! with tandem-affinity 
purification’. In this assay, a pair of TFs tagged with different affinity tags, SBP 
and 3 x Flag, and a double stranded DNA ligand that contains a 40-bp randomized 
region are mixed together in individual wells of a 96-well plate in a buffer that 
mimics biological conditions in the nucleus”, and the mixture is incubated for 
30 min, after which the bound dsDNA ligands are separated from free ligands 
through consecutive affinity purification by first the SBP and then the 3 x Flag 
tagged protein using affinity beads and an automated plate washer (Fig. 1a). Bound 
DNA is then amplified by PCR and sequenced, and the selection process repeated 
three times. Binding of the TFs is then revealed by enrichment of characteristic 
subsequences (see below). 

The 40-bp random window corresponds to almost four complete turns of the 
DNA helix, allowing detection of interactions between two TFs that exclusively 
occupy 9 bp of sequence (see ref. 20) over two full helical turns. As with our previ- 
ous HT-SELEX platform, the purified ligands are barcoded and directly compatible 
with multiplexed Illumina sequencing (for selection ligand sequences, please see 
Supplementary Table 1). 

Of the proteins tested here, most (87%) were functionally validated by 
HT-SELEX (see Supplementary Table 1 for details). The low activity of some 
HT-SELEX validated proteins (Supplementary Table 1) was probably due to the 
fact that CAP-SELEX involves two consecutive affinity purifications, and is there- 
fore more stringent than HT-SELEX. 

For CAP-SELEX, 10-200 ng (see Supplementary Table 1) purified Flag- and 
SBP-tagged proteins were diluted into 25 ul volume of binding buffer (140 mM KCl, 
5mM NaCl, 2mM MgSOug, 3 uM ZnSOg, 100 uM EGTA, 1 mM K3;HPO,, in20mM 
HEPES, pH 7.0) containing approximately 10 umol DNA selection ligands, and 
incubated for 20 min at room temperature. Subsequently, 0.2% BSA, 0.1% Tween 20 
pre-blocked Streptavidin-coated magnetic Sepharose beads (1.25 ul; GE Healthcare 
Streptavidin Mag Sepharose) in two volumes of binding buffer were added, and the 
mixture incubated at room temperature for 2h with vigorous shaking (800 r.p.m.; 
Edmund Biihler TiMix shaker). The beads were subsequently washed 5 times with 
binding buffer, using BioTek 405 CW plate washer with a magnetic platform. The 
protein-DNA complexes were eluted from the beads using 50 ul of 10 mM biotin 
(Sigma) in binding buffer. The eluate was transferred to a fresh plate containing 
M2 anti-Flag magnetic beads (1.25 ul; Sigma) in 50 ul of binding buffer, and shaken 
at 800 r.p.m. for 20 min at room temperature. The beads were washed ten times 
with binding buffer, suspended in 0.1% Tween 20, 0.5mM EDTA, 10 mM Tris-Cl, 
pH 8.0, and transferred to a PCR plate. DNA was then eluted from the beads by 
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incubation at 95°C for 10 min. A 9 ul aliquot of the bead suspension was trans- 
ferred to a new PCR plate, and the DNA amplified by PCR (65°C for 10s, 72°C 
for 36s, 97°C for 15s for annealing, elongation and denaturation, respectively, for 
25 cycles). A separate aliquot was analysed by qPCR (Roche LightCycler 480) 
to monitor progress of the experiment. Amplified selection ligands were then 
subjected to sequencing and new cycles of CAP-SELEX (up to 3 cycles total). 

The input libraries and libraries selected for three cycles were then sequenced 
and analysed (see below). Cooperative complexes were initially detected from 
5-12-bp long primary and secondary binding models generated by the previously 
described SELEX data-analysis tool IniMotif*". Initial results validated the method 
through confirmation, characterization and refinement of the binding specificity 
models for 12 previously known heterodimers (Fig. 1a and Extended Data Fig. 1). 
Generation of PWMs. To detect heterodimeric sites that can occur in many dif- 
ferent orientations and spacings, we used the de novo motif discovery algorithm 
Autoseed!®. Autoseed finds gapped and ungapped subsequences that represent 
local maxima, that is, are more enriched than closely related subsequences. Control 
experiments established that such preferences were not observed in the input 
libraries. 

Based on analysis of CAP-SELEX cycle 3, the sequencing was then extended to 
cover products from earlier selection cycles for samples that showed enrichment 
of motifs that were similar to the expected TF2 motif. We have previously shown 
that analysing SELEX data from early cycles allows recovery of low-affinity sites, 
and results in motifs that are very similar to those determined using competition 
assays®!6, 

For identification of co-operative sites where the individual TF motifs were 
spaced farther apart, we defined representative 6-mer sequences for each TE. For 
each experiment, we then identified cases where both representative 6-mers were 
found in the same sequence reads, and from these reads, counted the occurrence of 
each spacing and orientation combination. The expected distribution of spacings 
without any preferences is non-uniform due to the limited size of the randomized 
region but cannot explain the local maxima observed in the data (spacings that 
are preferred compared to both shorter and longer spacings). Given the size of 
both occupied sites m and a fixed spacing n between them, the number of ways 
the sites can be placed into the randomized region of size lis c, =1— 2m-n+1 
(0<n<1— 2m), or twice that if the order of sites is taken into account. As the 
expected count of spacing n is directly proportional to c,, the count is a linearly 
decreasing function of n and thus has no maxima except the one at the boundary 
n=0. Moreover, the expected relative difference of counts between consecutive 
spacings is (c,—- 1 — Cy)/Cy = 1/cy indicating that the expected differences are small 
in the range where spacing preferences are observed. 

To display the preferred spacings and orientations of the two TFs, subsequences 
containing both consensus 6-mer sequences of the motifs with variable-length gap 
between them were counted, and the counts represented as a heatmap (for example, 
Fig. 1). For each orientation of the 6-mers, we identified the spacing with maxi- 
mum counts. That spacing was considered to be preferred if the sum of the counts 
of it and its two neighbours was higher than 30% of the total count for all spacings, 
and less than 20% of the reads counted were derived from a single non-unique 
read. If one preferred case was identified for a pair of 6-mers, we then determined 
whether other spacings and orientations were also enriched. Up to five spacings 
and orientations were considered to be preferred if their respective counts were 
higher than 50% of the maximum count after mean normalization of all counts. 
Cases where both TF1 and TF2 6-mers were detected robustly, but no preferred 
orientation and spacing was detected were classified as ‘weak or no co-operativity. 
In case of experiments performed several times, interactions were called if they 
passed the thresholds in at least one case. In each case, control unselected ligands 
were also sequenced, to ensure that the oligonucleotide synthesis resulted in even 
distribution of mononucleotides. In addition, several ligands were sequenced very 
deeply (>10 million reads) to ensure that 6-mer subsequences were distributed 
at a similar frequency along the 40-bp random sequence window (Extended Data 
Fig. 4). For assessment of reproducibility, see Extended Data Fig. 3. 

Subsequently, the enriched subsequences were used as seeds to generate position 
weight matrix (PWM) models for the complexes. We generated PWM models as 
described in Jolma et al.3*!, using the seeds, selection cycles and multinomial set- 
ting indicated in Supplementary Table 2. The models were subsequently inspected 
to remove cases that could be also explained by homodimeric binding (for example, 
cases where two TFs with similar primary motifs were analysed). After identi- 
fication of an enriched sequence, seed refinement was performed essentially as 
described in Jolma et al.®. In the majority of the cases, the PWM models were 
generated using selection cycle 3. The models were expert curated to separate 
different binding modes, and to remove cases where there was excessive posi- 
tional bias or where the two proteins bound to very similar sites and thus we could 
not differentiate between homo- and heterodimeric binding. All seed, cycle and 


multinomial settings used are indicated on Supplementary Table 2, and the 
sequence reads have been deposited to ENA under accession (PRJEB7934). 

Hydrogen bond contacts between TF amino-acids and DNA bases were identi- 

fied using the program CONTACT that is included in the CCP4 software suite®. 
Hydrogen bond information was added to PWMs to generate contact annotated 
PWMs (pfmc). PWMs were aligned to each other by minimizing the sum of indi- 
vidual base-to-base comparison scores calculated as follows: Max (information 
content for PWM1 position n, information content for PWM2 position m) * 
(Manhattan distance between base frequencies of PWM1 position n and PWM2 
position m). In regions where there was no overlap, the positions were compared 
to an equal frequency of all bases. Pairs of positions whose score was smaller than 
0.25 are indicated by boxes in Supplementary Data Set 2. The same cut-off was 
used to count divergent base positions in Extended Data Fig. 6. 
Enrichment of motifs at ChIP-seq TF clusters and prediction of ChIP-seq peaks. 
Interactions between TFs appear to be important in vivo, as recent large scale 
genome-wide location analyses of TFs in cultured cells have revealed that in a 
given cell type, TFs bind only to a subset of their potential target sites, and that the 
occupied target sites are located in high-density clusters, where many different 
TFs colocalize within a few hundred bp long regions!**°, Many of the occupied 
sites do not contain high-affinity sites for the analysed TF, suggesting that co- 
operative interactions allow binding of TFs to low-affinity sites!?"°, Formation 
of TF clusters is probably at least in part the result of competition between TFs 
and nucleosomes, which indirectly results in increased occupancy of TFs close 
to each other, even when the TFs do not have direct cooperative interactions’. 
Another mechanism that could contribute to TF cluster formation is direct co- 
operativity between TFs. To test this, we analysed enrichment of monomer and 
heterodimer PWM matches at human LoVo colon cancer TF clusters’? (Fig. 3a 
inset, Supplementary Tables 2 and 4) as described in Yan et al.!”, using a score 
cut-off for each motif that resulted in one match per 10 kb of genomic sequence. 
P values for the enrichment were calculated using winflat. Similarity of TF DBD 
amino-acid sequences was determined using PRANK”. Similarity of PWMs was 
determined using SSTAT“S using the default parameters. Motif dendrogram was 
drawn by using Euclidean distance metric with average linkage with R package ape. 
Network shown in Fig. 3 was drawn based on the same distances. The dominating 
set of the models was generated essentially as described in Jolma et al.°. Briefly, 
we first generated a network where monomeric or homodimeric motifs from Jolma 
et al.8, and heterodimer motifs from this study were connected to each other if 
they were similar. To determine how many novel specificities were identified in 
our study, we used the minimum dominating set of this network to identify a set 
of motifs that represents distinctly different specificities. 

Sequence and barcode-logos were generated as described in Nitta et al.'°. In 
barcode logos shown in Fig. 3 and Extended Data Figs 3 and 7, four bars are drawn 
that represent the frequency of the bases for each base position. Width of each bar 
is proportional to the frequency of the corresponding base (range 0 to 1), and both 
height and colour intensity of all the bars at a given position are proportional to the 
frequency of the most common base at that position (range 0.25 to 1). In the DNA 
base letter PWM logos shown, the height of each letter is directly proportional to 
the frequency of the indicated base at the indicated position. 

Analysis of error rates in prediction of ChIP-seq/exo peaks using the dimeric 
PWMs was performed using a random forest classifier. For the random forest classi- 
fier, the R package randomForest, version 4.6-6, was used (http://cran.r-project.org/ 
web/packages/randomForest/index.html). The classifier was trained on TF motif 
matches to discern peak summit regions from close by non-peak summit genomic 
positions. Stated accuracy estimates are based on out-of-bag error estimates. 

ChIP-seq binding site prediction error rate analysis was performed on data 
gathered from existing ChIP-seq experiments and from the HOXB13 ChIP-exo 
experiment from this study. The FASTA sequences of 1,001 bp genomic regions 
surrounding ChIP-exo peak summits for HOXB13 (from this work) and previously 
described ChIP-seq peak summits for ELF1 (ref. 21), ELK1 (ref. 49), HOXB13 
(ref. 50) and two different MEIS1 experiments*>°? were used as a positive set 
together with a negative set consisting of 1,001 bp FASTA sequences taken from 
2,000 bp away from the peak summits. For each FASTA sequence in the collection, 
the score and relative position of the highest-scoring match to each motif in the 
PWM collection was recorded and used to train a randomForest classifier with 
5,000 trees. To determine whether the dimer partners had predictive power, a 
classifier trained using the monomer motif of the relevant TE, and all its dimer 
motifs was compared to a classifier trained using the monomer motif and dimer 
motifs with the partner region of the motif reversed. Error rates were estimated 
using out-of-bag predictions. 

Analysis of conservation of motif matches. To measure the conservation of 
genomic sites recognized by heterodimeric motifs we developed a procedure to test 
whether the heterodimeric motifs explained patterns of evolutionary conservation 
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observed in the human regulatory elements. To identify the potential conservation 
attributable to the specific TF-TF-DNA interactions described by the motifs, the 
genomic sites recognized by each heterodimeric motif were compared to sites 
recognized by artificial control motifs that represented different orientations of 
the two TFs but were otherwise comparable to the original motif, for example had 
the same width and information content. To obtain control motif sets containing 
the specificities of the individual TFs as embedded in the heterodimeric motifs, 
each heterodimeric motif was split into two partial motifs at all possible cut points 
with the restriction that both the ‘left’ and the ‘right’ sections were at least one- 
third of the width of the whole motif (rounded down). The control motifs were 
constructed by concatenating each of the partial motif pairs in all three alternative 
orientations (‘right’ followed by ‘left; ‘left’ followed by the reverse complement of 
‘right, and the reverse complement of ‘left’ followed by ‘right’). For example, a motif 
of length fifteen was split after 5-10 bases resulting in 18 control motifs. 

Sites recognized by a heterodimeric motif and its control motifs were searched 
from the human genome constrained elements® (SiPhy pi 12-mers with 10% false 
discovery rate in reference genome hg19, regions shorter than 50 bp or overlap- 
ping exons or repeats according to Ensembl version 70 were removed resulting 
in total 41 Mb of sequence) using the program MOODS™ with a loose cut-off (P 
value <10~? with flat background distribution) to obtain a large excess of putative 
binding sites for each motif. All found sites were merged into one list and 10,000 
non-overlapping highest affinity sites selected for conservation analysis regardless 
of the motif identity (heterodimer or control). 

Whether the evolutionary conservation of the high affinity sites was explained by 
the motifs was tested using program SiPhy*® (version 0.5, task 16, seedMinScore 0) 
and multiz100way multiple alignments” of 99 vertebrate species to human (down- 
loaded from UCSC genome browser, version hg19). A site was marked as being 
conserved according to the motif if its SiPhy score was positive meaning that the 
aligned bases at the site were better explained by the motif than by a neutral evo- 
lutionary model (hg19.100way.phastCons.mod obtained from UCSC genome 
browser). 

The hypothesis that the heterodimeric motif sites were more likely to be 
conserved according to the motif than the sites of its control motifs was tested 
against the null hypothesis that there was no association between site conserva- 
tion and motif identity using Fisher’s exact test (one-sided). The P values given 
by the tests for individual heterodimeric motifs were corrected for multiple 
testing using Holm’s method. This procedure detected genomic conservation 
for 149 out of 618 motifs (24%) at family-wise error rate <0.05 (including the 
previously known ETV2-FOXI1 motif” used as a positive control while devel- 
oping the procedure). 

Protein purification, crystallization and data collection. The DNA-binding 
domains of human MEIS1 (residues 277-339) and DLX3 (residues 122-193) were 
overexpressed as a thioredoxin-6His fusion protein in E. coli and isolated from the 
soluble cell lysate by affinity chromatography followed by gel-filtration chromato- 
graphy as described in ref. 57. The DNA fragments used in crystallization were 
obtained from MWG as single strand oligonucleotides and annealed in 150 mM 
NaCl, 1 mM EDTA in 10mM Tris-Cl, pH 7.5. The purified proteins were first 
mixed with solutions of annealed DNAs at a molar ratio of 1:1.2 and after incu- 
bation for 15-20 min on ice subjected to the crystallization trials. Crystallization 
experiments were carried out with an in-house developed crystal screening kit of 
different polyethylene glycols. The crystals of MEIS1-MEIS1-DNA complex were 
obtained in sitting drops at room temperature from 100 mM Tris-Cl (pH 8) solu- 
tion containing 25.6% (w/v) PME (5000), 80 mM MgCl and 10% PEG (400). The 
crystals of monodimer MEIS1-DNA complex were obtained from 100 mM HEPES 
(pH 7.09) solution containing 30% (w/v) PME(5000), 80 mM MgCl and 10% 
PEG (400). Crystals of MEIS1-DLX3-DNA complex were obtained from 100 mM 
Tris-Cl (pH 7.5) containing 24% PEG (8000), 40 mM MgCl and 5% butanol. All 
diffraction data for both complexes were collected at beam-line ID23-1 at the ESRF 
(Grenoble, France) using the reservoir solution as cryo-protectant. The data col- 
lection strategy was optimized with the program BEST”. The data were integrated 
with the program XDS* and scaled with SCALA™. Statistics of data collection are 
presented in Supplementary Table 5. 

Structure determination and refinement. The structures of all three complexes 
were determined by molecular replacement using the program Phaser® in Phenix™ 
with the structure of MEIS2 and DLX5 (PDB entries 3K2A and 2DJN, respectively) 
as a search model. The manual rebuilding of the model was done using COOT™ 
combined with refinement with Phenix.refine using TLS option. The refinement 
statistics are presented in Supplementary Table 5. The atomic coordinates and 
diffraction data have been deposited to Protein Data Bank with the accession 
codes 4XRM, 5BNG and 4XRS, for MEIS1 homodimer, MEIS1 monomer and 
MEIS1-DLX3 heterodimer, respectively. 
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Data reporting. No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded to allocation 
during experiments and outcome assessment. 
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Extended Data Figure 1 | CAP-SELEX data analysis and comparison to 
previous data. a, Flowchart of CAP-SELEX data analysis. Left, a library 

of selection ligands with random sequences (yellow) is incubated with 

TFs. After CAP-SELEX, enriched individual TF motifs (1°; arrows) and 
composite motifs that are not simply combinations of the individual motifs 
(2°; green dots) are detected from the reads. To detect preferential spacings 
and orientations of the TF pair (3°), co-occurrence of the indicative 6-mer 
subsequences (arrowheads) are counted from the reads. The subsequences 
are then used to generate seeds for the PWM models (right). Heatmap 
(bottom right; scale divided by highest observed count) shows frequency 
of occurrence of the two 6-mers (CCGGAA, red arrowhead; CATTCC, 
black arrowhead) in all possible spacings (columns) and orientations 
(rows). Note that the 6-mer based approach cannot model the composite 


Known consensus Method — Ret. 
CTGACGCAAT EMSA (26) 
CTGACGCAAT EMSA (66) 
GTCGTAAAACTGTCA EMSA (27) 
CATTAGCATGACAAAGACA Se ne (28) 
EMSA (64) 

Method 
Mammalian two hybrid (37) 
Mammalian two hybrid (37) 
Mammalian two hybrid (37) 
Mammalian two hybrid (37) 
Protein microarray (38) 
co-immunoprecipitation (65) 
Yeast two hybrid, (67) 
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site, but identifies a strong case of cooperativity where the ERG 6-mer 
CCGGAA is followed by the TEAD4 6-mer CATTCC site with an 8 bp gap. 
Logo of the PWM for this site is also shown. b, Comparison between 
CAP-SELEX PWMs and previously characterized specificities for 

the indicated TF pairs. This method has been used previously and its 
references are also indicated. CAP-SELEX models also shown in Fig. 1 
are indicated by asterisks. Note that four out of five of the CAP-SELEX 
models are similar to the previously identified consensus sequences. The 
exception is ELK1-PAX5 consensus, that matches poorly both the CAP- 
SELEX motif and individual motifs for ELK1 and PAX5 (not shown). 

c, CAP-SELEX PWMs for TF pairs known to interact at protein level. 
Method used to identify the protein-protein interaction and its reference 
are also shown®26-28:37:38,64-67, 
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Extended Data Figure 2 | Pairwise interaction matrix between TFs. Fig. 1). Histograms show the counts for the interactions for each TF. 
Columns indicate TF1 proteins, and rows TF2 proteins, subjected to the Only TFs for which at least one clear interaction or independent binding 
first and second affinity purifications, respectively. Pairs of TFs with a was identified are included. The importance of including DNA in the 
single spacing and orientation preference are indicated in dark green, interaction assay is highlighted by the fact that only four and five of the 
and pairs with multiple preferred configurations in light green. White interactions detected are among those observed between 762 human or 
boxes indicate pairs that displayed weak or no interaction, and grey boxes 877 mouse TF pairs identified using protein-protein interaction assays””, 
cases where robust preference data was not recovered. Previously known or compiled from literature*, respectively. 


interacting TF-pairs are indicated by a yellow outline (see Extended Data 
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Extended Data Figure 3 | CAP-SELEX reproducibility. a, Replicate and barcode logos of all PWM pairs. Plot in the middle shows fraction of 
analysis of more than two hundred of the generated PWMs. The same seeds reads included in the same models in replicate 1 and 2. b, Validation of 
that had been used to generate PWMs for the primary experiments were the CAP-SELEX analysis using shortened TF constructs (DBD+) by HT- 
used to seed new PWMs from the replicates. Left, red bars on the left show SELEX using full-length protein mixtures (full length). Note that the same 
the percentage of the PWM pairs that are similar at the indicated cut-offs orientation and spacing is preferred in all but one of the cases. In one case 


(measured as SSTAT covariance*“*). The highest threshold is the same used (bottom), full-length proteins show the highest preference to a different 
for identifying the dominating set of PWMs. Blue bars indicate fraction ofall spacing than that observed in CAP-SELEX; even in this case, the second- 
replicate PWMs that are similar using the same cut-off. Right, dendrogram most preferred spacing is the one identified using CAP-SELEX. 
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Extended Data Figure 4 | Long-range cooperativity. Many experiments 
where TFs bound sites that were relatively far apart showed preferential 
binding to sites that are separated by approximately nine to ten bases. 
Heatmap (maximum count set to 1) representations showing frequency 
of occurrence of the representative 6-mers for TF pairs in all possible 
spacings (columns) and orientations (rows). a, Replicate experiment 

of GCM1 (black arrowhead) and MAX (red ball) pair show very 

similar preference for cooperatively bound representative 6-mers (see 
Supplementary Table 1). While one of the orientations shows preference 
for a single spacing, the second has two preferentially recognized regions 
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separated by ~9 bp. b, TEAD4—CEBPB pair shows a similar ~9 bp 
separation between three regions of preferred spacings (brackets). c, Very 
deep sequencing of the unselected input ligand does not show the same 
preference, instead counts decrease linearly as a function of gap length 
(due to decreasing number of available positions in the 40N random 
sequence). The mode of cooperativity seen in a and b appears similar 

to that reported by Kim eft al.'’. In addition to high-affinity sites, lower 
affinity spacings and orientations between TF pairs could be employed in 
fine-tuned transcriptional responses (see refs. 68, 69). 


© 2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Prevalence of interaction types based on structural analysis 


DNA mediated 


DNA facilitated 


N st 


Potentially 
protein mediated 


b Evolutionary conservation of the dimeric PWMs 


Re eCTCF 
+ - 
e = 
oO ° ba Dio 
oN ae ° e HOXB2:ETV4 = 
= ee eTEAD4:ERG = nee 
G i) - 
= 08 EEO etve:FOx : i 
o gst FAR “ £, ° PBX4HGXAI0 Bog 
3 Ae oRIx2 
— eGo e 0 8: 2 saps NEFA 
ir ra F 
= Mi esol oc 
° ° 
0 400 800 1200 0 400 800 1200 1600 2000 


Number of conserved motif sites 
among tested sites 


Prediction of ChIP-seq peaks 
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Extended Data Figure 5 | CAP-SELEX motifs are conserved 

and enhance prediction of in vivo peaks. a, Pie chart showing 

the frequency of DNA-mediated, DNA-facilitated and potentially 
protein-protein interaction mediated heterodimers in the CAP-SELEX 
data set. Cooperativity between TFs can result from direct contacts 
between the proteins (protein-mediated), DNA-facilitated protein 
contacts (DNA-facilitated) or arise indirectly from DNA-mediated 
interactions'7***9407.71, The last type of cooperativity is caused by the 
DBD binding-induced changes in DNA shape, and do not involve other 
domains or direct contact between the proteins'”***". The dimers were 
classified to DNA-mediated, DNA-facilitated and potentially protein- 
protein interaction mediated classes manually, based on structural models 
shown in Supplementary Data Set 2 . b, Conservation of the genomic 
sites recognized by the CAP-SELEX identified heterodimeric motifs (left) 
compared to monomeric and homodimeric sites identified by HT-SELEX 
(right, motifs from ref. 8). For each motif, ten thousand non-overlapping 
highest affinity sites within human constrained non-coding regions 
recognized by the motif or one its control motifs (see Methods for details) 
were selected and their conservation was tested. The fold enrichment 

(y axis), that is, the fraction of conserved sites among the motif sites 


(Penkov) 
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HOXB13 ChIP-Exo, original vs. reverse partner 


Doriginal 
7G reverse partner 


0.5 


MEIS1 


0 200 
relative genomic position 
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divided by the fraction of conserved sites among the control motif sites, 

is shown as a function of the number of conserved motif sites among the 
top ten thousand sites (x axis). The motifs that are significantly conserved 
(multiple testing adjusted P value <0.05) are marked green. Five motifs 
with lowest P values are also indicated. Note that ~50% of the HT-SELEX 
and ~25% of the CAP-SELEX motifs are conserved above the significance 
threshold. c, Inclusion of heterodimeric motifs improves prediction 

of ChIP-seq peaks. Left, the error rate of prediction of ChIP-seq peak 
positions using either the monomer motifs and CAP-SELEX dimers (light 
grey), or monomer motifs and control motifs where the partner of the 
indicated TF is reversed but not complemented (dark grey) are shown. 
Note that inclusion of the correct heterodimeric motifs decreases the 
prediction error rate in the cases of HOXB13 and MEIS1. The relatively 
modest effect is likely due to the fact that only a subset of heterodimers 
were identified in our study, and that ChIP-seq peak positions are also 
influenced by other factors such as nucleosome binding and chromatin 
structure. Right, number of PWM matches as a function of distance from 
HOXB13 ChIP-exo peaks. Note that using the original heterodimer motifs 
clearly outperforms the control motifs. 
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a Summary of positions altered by heterodimer formation 
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Extended Data Figure 6 | Heterodimers where the individual TF core 
recognition sites appear to overlap. a, Composite site formation alters 
specificity at bases flanking the core TF site. TFs often directly read 
specific ‘core’ sequence motifs via hydrogen bonding to DNA bases. The 
sequences flanking this core are commonly read indirectly, through 
contacts to the sugar and phosphate backbone of DNA”””4. The backbone 
contacts specify a preferred DNA conformation, which then leads to a 
preference of a sequence that is optimal for stacking interactions between 
consecutive base pairs (reviewed in ref. 74). Figure shows summary of 
base positions whose specificity is affected in all composite sites identified 
in this study for the indicated TFs. Note that the bases comprising the 
core motif that is recognized by direct hydrogen bonds to the DNA 

bases are not commonly affected by heterodimer formation. In contrast, 
specificity at flanking positions that are recognized by contacts to the 


C000 300 


sugar or phosphate backbone of DNA are commonly altered by binding 

of the heterodimer partner. Hydrogen bonds contacts were determined 
based on the indicated (refs 29 and 30) or homologous TF structures (see 
Supplementary Table 3). b, A base (arrow) can be contacted both from the 
side of the major groove (black dot; G contacted by GCM1) and the minor 
groove (white dot; C contacted by DRGX homeodomain). c, A TF that can 
bind to a homodimeric site appears instead to bind as a heterodimer. 

A composite site is shown where HOXB2 appears to form a heterodimer 
with a monomer of REX5. d, In some cases, the binding positions of the 
TFs cannot be unambigiously assigned based on the composite recognition 
sequence. In a, the annotation of hydrogen bond contacts is as described in 
main Fig. 2; in b-d, the major groove contacts of the left and right TFs are 
indicated in black and red dots, respectively. 
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Representative PWMs of heterodimeric complexes and individual TFs 
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Extended Data Figure 7 | Specificities of individual TFs and heterodimer pairs. Dendrogram shows motif similarities between the representative 
heterodimer and monomer motifs. Heterodimer models are indicated by green bars. Barcode logos for each factor are also shown. Centre of dendrogram 
shows the colour key for the TF families. 
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a Comparison of top computationally predicted models by Guturu et al., 2013 
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Extended Data Figure 8 | Comparison of CAP-SELEX models to models 
inferred from conserved genomic sequences. a, Motifs that are very 
similar to the CAP-SELEX motifs are enriched and conserved. A previous 
study by Guturu et al.?° made structural models for pairs of TFs to identify 
sterically possible configurations and predict sites that could be bound by 
such complexes. Enrichment of those target sites were then quantified in 
evolutionarily conserved noncoding regions over nonconserved control 
regions to infer putative target sites for cooperatively binding TFs. Pie 
chart shows comparison of top 100 most significant target sites predicted” 
to all heterodimeric PWMs generated in this study. 15 PWMs showed clear 
similarity to our heterodimeric PWMs (upper right, dark green slice), 
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8 were partially similar (green) and further 5 had enrichment for the 

site but under the threshold used in our study. We did not detect 

25 motifs, and for 14 potential pairs, we identified a different spacing and 
orientation. This result is expected as we did not test all potential TF-TF 
pairs, and many TFs that bind to similar monomer sites prefer different 
dimer spacings and orientations. Finally, of the 100 Guturu et al.”° top 
motifs, 33 were not analysed in our study (14 were homodimeric and 

no possible pair was tested for 19; for example, three of the pairs were 
predicted for pairs with a SMAD TE, and no SMAD TFs were tested in our 
study). b, Comparison of the 15 (top) and 8 (bottom, boxed) matching and 
partially matching PWMs, respectively. 
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Extended Data Figure 9 | Detailed view of MEIS1 and MEIS1-DLX3 structures. a, Contacts between MEIS1 (cyan) and DNA. b, Contacts between 
DLX3 (magenta) and DNA. c, Comparison of the DNA structures in MEIS1 homodimer (cyan) and MEIS1-DLX3 heterodimer (magenta). Note that the 
DNA bound to the heterodimer is more distorted. 


© 2015 Macmillan Publishers Limited. All rights reserved 


Mae ed fea 


doi:10.1038/nature15401 


Histone HI couples initiation and amplification of 
ubiquitin signalling after DNA damage 


Tina Thorslund', Anita Ripplinger'*, Saskia Hoffmann!*, Thomas Wild?*, Michael Uckelmann’, Bine Villumsen!, Takeo Narita’, 
Titia K. Sixma’, Chunaram Choudhary’, Simon Bekker-Jensen! & Niels Mailand! 


DNA double-strand breaks (DSBs) are highly cytotoxic DNA 
lesions that trigger non-proteolytic ubiquitylation of adjacent 
chromatin areas to generate binding sites for DNA repair factors. 
This depends on the sequential actions of the E3 ubiquitin ligases 
RNFS8 and RNF168 (refs 1-6), and UBC13 (also known as UBE2N), 
an E2 ubiquitin-conjugating enzyme that specifically generates 
K63-linked ubiquitin chains’. Whereas RNF168 is known to cata- 
lyse ubiquitylation of H2A-type histones, leading to the recruit- 
ment of repair factors such as 53BP1 (refs 8-10), the critical 
substrates of RNF8 and K63-linked ubiquitylation remain 
elusive. Here we elucidate how RNF8 and UBC13 promote recruit- 
ment of RNF168 and downstream factors to DSB sites in human 
cells. We establish that UBC13-dependent K63-linked ubiquityla- 
tion at DSB sites is predominantly mediated by RNF8 but not 
RNF168, and that H1-type linker histones, but not core histones, 
represent major chromatin-associated targets of this modification. 
The RNF168 module (UDM1) recognizing RNF8-generated 
ubiquitylations" is a high-affinity reader of K63-ubiquitylated 
H1, mechanistically explaining the essential roles of RNF8 and 
UBC13 in recruiting RNF168 to DSBs. Consistently, reduced 
expression or chromatin association of linker histones impair 
accumulation of K63-linked ubiquitin conjugates and repair fac- 
tors at DSB-flanking chromatin. These results identify histone H1 
as a key target of RNF8-UBC13 in DSB signalling and expand the 
concept of the histone code’””* by showing that posttranslational 
modifications of linker histones can serve as important marks for 
recognition by factors involved in genome stability maintenance, 
and possibly beyond. 

To explain mechanistically the critical role of UBC13 in the RNF8/ 
RNF168 pathway, we generated human UBC13 knockout cells using 
CRISPR-Cas9 technology’*”* (Fig. 1a). As expected, loss of UBC13 
abrogated the accumulation of RNF168, RNF168-dependent ubiquitin 
conjugates, and 53BP1 and other readers of these marks at DSB sites, 
while RNF8 recruitment was normal (Fig. 1b, c and Extended Data 
Fig. la, b). Reintroducing UBC13 into these cells restored 53BP1 focus 
formation, as did a fusion protein mimicking an RNF8-UBC13 E3-E2 
complex’*"’”, independently of endogenous RNF8 (Fig. 1d and 
Extended Data Fig. 1c, d). In contrast, expression of RNF8 alone or 
other RNF8-E2 chimaeras failed to support 53BP1 accumulation at 
DSBs in these cells (Fig. 1d and Extended Data Fig. 1c). Unlike RNF8, 
overexpression of RNF168 was sufficient to restore 53BP1 recruitment 
to damaged DNA in UBC13-knockout cells (Extended Data Fig. le). 
These data suggest that UBC13 mainly cooperates with RNF8 but not 
RNF168 to catalyse K63 ubiquitylation of DSB-flanking chromatin to 
promote recruitment of repair factors. 

To understand the molecular basis of this process, we used a tandem 
ubiquitin-binding entity (‘K63-Super-UIM’)"* that showed remark- 
able specificity for binding to K63 linkages but not to other ubiquitin 


chain types (Fig. le, f). Indeed, green fluorescent protein (GFP)-tagged 
K63-Super-UIM, but not a ubiquitin-binding-deficient mutant 
(Fig. le, f), was efficiently recruited to microlaser- and ionizing radi- 
ation (IR)-generated DSBs (Extended Data Fig. 2a-c), thus providing 
an efficient sensor of DSB-associated K63 ubiquitylation’. Depletion 
of UBC13 or RNF8, but not RNF168, abrogated recruitment of the 
K63-Super-UIM, as well as an independently derived high-affinity 
binder of K63-linked ubiquitin chains, to DSB sites, and auto-ubiqui- 
tylation-generated K63 ubiquitin chains were abundantly present on 
RNFS8 but not RNF168 (Fig. 1g and Extended Data Fig. 2b-f). This 
suggests that the roles of RNF8 and RNF168 in promoting DSB-assoc- 
iated chromatin ubiquitylation can be functionally uncoupled and that 
RNF8, but not RNF168, is a primary mediator of UBC13-dependent 
K63 ubiquitylation at DSB sites. 

To identify the RNF8-UBC13-dependent ubiquitylation processes 
underlying RNF168 recruitment to DSBs, we first characterized the 
impact of UBC13 loss on global K63-linked ubiquitylation under 
steady-state conditions. While K63-linked ubiquitylation is thought 
to be mainly catalysed by UBC13 (ref. 7), the extent to which this E2 
contributes to the total pool of K63 ubiquitin linkages in human cells is 
not known. To address this, we quantified the relative abundance of 
different ubiquitin chains in HCT116 wild-type and UBC13-knockout 
cells using stable isotope labelling by amino acids in cell culture 
(SILAC)-based mass spectrometry. Ablation of UBC13 decreased 
the global level of K63 ubiquitin linkages by ~50%, while the abund- 
ance of other chain types was not markedly affected (Fig. 2a). Using the 
di-glycine approach”®”’, we quantified over 3,000 ubiquitylation sites, 
of which fewer than 1% showed a >2-fold decrease in UBC13-knock- 
out cells (Extended Data Fig. 3a—d and Supplementary Table 1). Thus, 
UBC13 has little impact on the conjugation of the initial ubiquitin 
moiety to substrates, consistent with previous reports that UBC13 
primarily acts at the ubiquitin chain elongation step”. To identify 
UBC13-dependent K63-ubiquitylated proteins, we analysed K63- 
Super-UIM pull-downs from wild-type and UBC13-knockout cells 
under stringent buffer conditions by mass spectrometry. We identified 
371 proteins that showed >2-fold enrichment in wild-type cells, sev- 
eral of which are known targets of K63 ubiquitylation (Extended Data 
Figs 3e-h, 4a, b and Supplementary Table 2). 

We extended this proteomic strategy to search for the chromatin- 
bound substrate(s), whose K63 ubiquitylation by RNF8-UBC13 pro- 
motes recruitment of RNF168 and downstream factors to DSB sites 
(Extended Data Fig. 4c). Surprisingly, only few cellular proteins repro- 
ducibly displayed elevated K63-linked ubiquitylation upon IR-induced 
DSBs, and H1 linker histones, but not core histones, were major chro- 
matin-associated factors showing such behaviour (Extended Data 
Fig. 4d). Using the K63-Super-UIM we confirmed biochemically for 
two endogenous histone H1 isoforms (H1.2 and H1x) that they were 
modified with K63-linked ubiquitin chains and, more importantly, 
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Figure 1 | RNF8 but not RNF168 is a primary mediator of UBC13- 
dependent protein recruitment to DSB-flanking chromatin. a, Immunoblot 
analysis of HCT116 wild-type (WT) and UBC13-knockout (KO) cells. MCM6 
was a loading control. WCE, whole-cell extract. b-d, Representative images 
of HCT116 wild-type and UBC13-knockout cells exposed to IR or laser micro- 
irradiation (n = 3 experiments). Cells in d were transfected with the indicated 
expression constructs (Extended Data Fig. 1c). HA, haemagglutinin. 

e, f, Binding of recombinant wild-type and mutant (MUT) K63-Super-UIM to 


that these ubiquitylations were markedly upregulated after DSBs 
(Fig. 2b and Extended Data Fig. 5a, b). Basal levels of H1 K63 poly- 
ubiquitylation in non-irradiated cells dropped substantially upon 
serum starvation (Extended Data Fig. 5a), suggesting that they result 
mostly from endogenous DNA damage generated by cell-cycle- 
dependent processes. Little if any K63-linked ubiquitylation of core 
histones was detectable, regardless of whether DSBs had been inflicted 
or not (Extended Data Fig. 5c), thus H1 appears unique among his- 
tones in undergoing robust DSB-stimulated K63 ubiquitylation. 
Importantly, the DNA-damage-induced increase in K63-linked poly- 
ubiquitylation of H1 was dependent on RNF8 and UBC13 but not 
RNF168 (Fig. 2c, d), whereas none of these enzymes appreciably affec- 
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—Ub 


di-ubiquitin (Ub,) linkages. The migration of molecular weight markers (kDa) 
is indicated on the left. g, Quantification of GFP—K63-Super-UIM and 53BP1 
accumulation at DSBs in U2OS/GFP-K63-Super-UIM cells transfected with 
the indicated siRNAs. CTRL, control. Data are mean + standard deviation 
(s.d.) from three independent experiments. Representative images are shown 
in Extended Data Fig. 2b. Scale bars, 10 jim. a, f, Uncropped blots are shown in 
Supplementary Fig. 1. 


ted H1 monoubiquitylation (Extended Data Fig. 5d). Moreover, mass 
spectrometry analysis showed that H1 was a major factor co-purifying 
with endogenous RNF8 (Extended Data Fig. 5e). Finally, recombinant 
RNFS8 catalysed robust ubiquitylation of nucleosome-associated H1 
with UBC13 in vitro (Fig. 2e). These data suggest that histone H1 is a 
major chromatin-associated substrate of RNF8-UBC13. 

To address whether linker histones represent critical RNF8 sub- 
strates in DSB signalling, we downregulated overall H1 levels using a 
short interfering RNA (siRNA) cocktail targeting different isoforms 
(Extended Data Fig. 6a, b). Under these conditions, accumulation of 
K63-linked ubiquitin chains, RNF168, 53BP1 and BRCAI, but not 
MDC1, at DSB sites was clearly diminished (Fig. 3a-d and Extended 
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Figure 2 | H1-type linker histones are major chromatin-bound targets of 
RNF8-UBC13-dependent K63 ubiquitylation. a, Tukey boxplot showing 
levels of ubiquitin linkages in HCT116 UBC13-knockout (KO) cells relative to 
wild-type (WT) cells, quantified by SILAC-based proteomics. Data from two 
independent experiments are shown. **P < 0.01; NS, not significant (Welch’s 
t-test). b-d, Pull-down analysis of IR-induced K63-linked ubiquitylation of 
histone H1.2 in U2OS cells (b, d) or derivative cell lines expressing RNF8 or 
RNF168 short hairpin RNAs (shRNAs) in a doxycycline (DOX)-inducible 


Data Fig. 6c-g), suggesting that loss of H1 uncouples ubiquitin- 
dependent DSB signalling at the level of RNF168 accrual. 
Consistently, like RNF8 or RNF168 knockdown, H1 downregulation 
suppressed DSB-induced H2A ubiquitylation catalysed by RNF168 
(Fig. 3e and Extended Data Fig. 6h)*”°. As an alternative strategy to 
interfere with H1 functionality, we overexpressed the high-mobility 
group protein HMGB1, which competes with H1 for chromatin bind- 
ing in a non-site-specific manner”. Elevated HMGB1 levels impaired 
K63 ubiquitylation and 53BP1 accumulation at DSB sites despite 
enhancing y-H2AX formation, and this could be rescued by co-over- 
expression of H1 or by wild-type but not catalytically inactive RNF168 
(Fig. 3f, g and Extended Data Fig. 7a-c). A similar strong increase in 
-H2AX formation accompanied by only mildly elevated 53BP1 accu- 
mulation at DSBs, indicating a partial uncoupling of these events, was 
reported for murine cells lacking three of six H1 isoforms”*. Together, 
these data suggest that H1-type histones are functionally critical targets 


manner (c) using recombinant wild-type or mutant (MUT) K63-Super-UIM. 
Where indicated, ubiquitin conjugates were digested with recombinant USP2 
catalytic domain (USP2cc) before SDS-polyacrylamide gel electrophoresis 
(SDS-PAGE). Asterisk indicates a non-specific band. e, In vitro ubiquitylation 
of purified H1-containing oligonucleosomes by RNF835)_4g5 and indicated 
factors. WCE, whole-cell extract. The migration of molecular weight markers 
(kDa) is indicated on the left. b-e, Uncropped blots are shown in 
Supplementary Fig. 1. 


of RNF8- and UBC13-dependent K63 ubiquitylation at DSB-flanking 
chromatin areas. 

We asked whether K63-ubiquitylated H1 provides a platform for 
initial, RNF8-UBC13-dependent RNF168 recruitment to DSBs. 
RNF168 contains two DSB-targeting modules, both of which contain 
ubiquitin-binding domains**”’ (Fig. 4a). While the carboxy-terminal 
LRM2-MIU2 region, which we termed ubiquitin-dependent DSB 
recruitment module 2 (UDM2), recognizes RNF168-catalysed forms 
of ubiquitylated H2A, thereby effectively enabling RNF168 to autono- 
mously propagate H2A ubiquitylation at DSB sites once recruited, its 
LRM1-UMI-MIU1 region (UDM1) has been suggested to bind as yet 
undefined RNF8-generated ubiquitylation products required for initial 
relocalization of RNF168 to break sites (Fig. 4a)’. Therefore, if K63- 
ubiquitylated H1 functions as a recruitment platform for RNF168 at 
DSB-modified chromatin, the UDM1 module should be capable of 
recognizing this modified form of H1. Consistent with this idea, 


19 NOVEMBER 2015 | VOL 527 | NATURE | 391 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


U20S/GFP-K63-Super-UIM 


GFP-K63- -HOAK+ 
Super-UIM _53BP1 DAPI 


0 0 
XY os 
es s & 
s Se 
e 


+ Laser microirradiation 


° 
oe 


H® siCTRL 
siH1 


siCTRL 


GFP-K63-UIM stripe 


(% cells relative to siCTRL) 
yb £ BD w 
So 56 56 6 
DNA damage stripe intensity 
(relative to siCTRL) 
Py RB 2 we sd 
oS: oO: -6: "6 36) 
— 
cS) 
, 


siH1 


b + + siCTRL 
U20S/GFP-K63-Super-UIM WT + IR re 4 sit 
GFP-K63- + + IR 
Si -UIM -H2AX 53BP1 _ 
— - e “Ec «y-H2AX-Ub, (K119 and K13/K15) 
a = —y-H2AX-Ub; (K119 or K13/K15) 
= 5 |—y-H2AX 
a é 
° ig ee wee we —-H2AX (low exposure) 
= 
c a ee ee ee HH 2AX 
= 
m ee eee ICC 
f U20S + IR g 
HMGB1-GFP+ 
_ -GFP GFP-H1.2 GFP-H1.2 


1004 


rm 2 
oO BE 
x9 
bagi) 
oo 504 
62 
= a 
a a 
a an 
oO a8 
ite} gf 


+ GFP 
+ + HMGB1-GFP 
+ + GFP-H1.2 


y-H2AX 


Figure 3 | Histone H1 is required for K63-linked ubiquitylation and 
RNF168-dependent protein retention at DSB sites. a, b, Representative 
images of GFP-K63-Super-UIM-expressing cells transfected with siRNAs. 
CTRL, control; DAPI, 4',6-diamidino-2-phenylindole. c, Proportion of cells 
with visible accumulation of GFP-K63-Super-UIM at laser-induced DSBs 
(a). d, Intensity of GFP-K63-Super-UIM or 53BP1 accumulation at laser- 
induced DSBs normalized to y-H2AX signal (a). e, Analysis of IR-induced 
y-H2AX ubiquitylation (Ub) by RNF168 (marked by arrow) in chromatin 
fractions of U2OS cells transfected with the indicated siRNAs. f, Representative 
images of U2OS cells expressing the indicated constructs and exposed to IR. 
HMGB1 overexpression impairs 53BP1 recruitment to DSBs (marked by 
arrows). g, Quantification of data in f. Data in c and d are mean = s.d. from 
three independent experiments. Data in g are mean + standard error of the 
mean (s.e.m.) from two independent experiments. Scale bars, 10 fm. 

e, Uncropped blots are shown in Supplementary Fig. 1. 


we found that UDM1 bound to K63-linked ubiquitin but no other 
chain types (Fig. 4b). RNF168 UDM2 also interacted with K63 lin- 
kages, albeit with much lower affinity, and unlike UDM1 it also recog- 
nized other ubiquitin chains (Extended Data Fig. 8a, b). This might 
reflect the fact that UDM1, but not UDM2, has two adjacent ubiquitin- 
binding domains (UMI and MIU1), which together may confer specific 
binding to K63-linked chains. We also found that UDM1 interacted 
with histone H1 but not H2A in vitro, whereas UDM2 displayed the 
inverse preference, binding only to H2A, as previously shown!! 
(Fig. 4c). The LRM1 part of UDM1 has been hypothesized to impart 
target specificity to the ubiquitin-binding affinity of this module”. 
Consistently, the H1-binding ability of UDM1 was at least partially 
mediated by the LRM1 motif (Fig. 4a), which interacted strongly with 
H1 in vitro, unlike LRM2 (Extended Data Fig. 8c). However, additional 
sequence elements within UDM1 were also able to bind H1 (data not 
shown), possibly owing to the very acidic nature of the UDM1 region 
(Extended Data Fig. 8d), which could facilitate robust interaction with 
the highly basic H1 proteins”*. Unlike UDM1, UDM2 interacted 
with neither modified nor unmodified forms of H1 (Fig. 4c, d). 
When expressed in cells, the UDM1 domain efficiently bound to 
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Figure 4 | The RNF168 UDM1 domain is a high-affinity reader of K63- 
ubiquitylated H1. a, Composition and reported functions of ubiquitin- 
dependent DSB recruitment modules (UDMs) in human RNF168. b, Binding 
of recombinant UDM1 to di-ubiquitin (Ub,) linkages. c, Binding of 
recombinant UDM1 and UDM2 to purified histones (H1.0 and H24A). 

d, Pull-down assays of Strep-tagged UDM1 and UDM2 modules expressed in 
U20S cells. The migration of molecular weight markers (kDa) is indicated on 
the left. WCE, whole-cell extract. e, Representative images of shRNF168- 
expressing cells transfected with GFP-RNF168 constructs and exposed to IR 
(n = 2 experiments). Deletion of UDM1 (AUDM1) impairs restoration of 
53BP1 foci by GFP-RNF168 (indicated by arrows). Scale bar, 10 tum. f, Model 
of RNF8-UBC13 function in ubiquitin-dependent signalling after DSBs. Me, 
methyl group; P, phosphate. b-d, Uncropped blots are shown in 
Supplementary Fig. 1. 


high-molecular-weight forms of endogenous H1 isoforms in an IR- 
stimulated manner, suggesting that they correspond to K63-ubiquity- 
lated H1 (Fig. 4d and Extended Data Fig. 8e). Supporting this, point 
mutations in the UMI or MIU1 domains that impair their ubiquitin- 
binding activity and RNF168 recruitment to DSBs''”” markedly 
reduced the binding of UDM1 to modified H1 (Extended Data 
Fig. 8f). Moreover, UDM1 did not interact with ubiquitylated forms 
of core histones, in sharp contrast to its binding to ubiquitylated H1 
(Extended Data Fig. 9a). Finally, in-frame deletion of UDM1 but not 
UDM2 impaired the ability of ectopically expressed RNF168 to 
promote IR-induced 53BP1 foci in cells lacking endogenous 
RNF168 (Fig. 4e). We conclude that the RNF168 UDM1 is a high- 
affinity recognition module for K63-ubiquitylated histone H1. While 
the isolated UDM1 domain was distributed pan-cellularly and inter- 
acted with a range of proteins when overexpressed in cells, the only 
such protein showing DSB-stimulated K63 polyubiquitylation was an 
H1 isoform (Extended Data Fig. 9b, c), further suggesting that ubiqui- 
tylated H1 is a major receptor for RNF168 at DSB-modified chromatin. 

Collectively, our findings suggest an integrated model for how 
RNF8 and UBC13 promote ubiquitin-dependent protein recruitment 
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to DSB sites. We propose that H1-type linker histones represent key 
chromatin-associated RNF8 substrates whose UBC13-dependent 
K63-linked ubiquitylation at DSB-containing chromatin provide an 
initial binding platform for RNF168 via the UDM1 (Fig. 4f). RNF168 
then ubiquitylates H2A at K13/K15 and possibly other proteins to 
trigger recruitment of DSB repair factors. The notable absence of 
K63-linked ubiquitin chains on H2A suggests that RNF168 does not 
efficiently modify H2A in conjunction with UBC13, but may instead 
catalyse formation of other ubiquitin chains or monoubiquitylation of 
the H2A K13/K15 mark. Indeed, RNF168 was recently shown to cata- 
lyse the formation of K27-linked ubiquitin chains, at least when over- 
expressed”. It is possible, however, that RNF168 cooperates with 
UBC13 in K63-linked ubiquitylation of other factors at damaged chro- 
matin. While we observed no marked DSB-induced change in overall 
nuclear H1 mobility and no overt enrichment or depletion of H1 iso- 
forms at DSB sites, the K63-ubiquitylated forms of H1 proteins were 
more loosely associated with chromatin than their unmodified coun- 
terparts (Extended Data Fig. 10a, b; data not shown). This suggests that 
in addition to triggering RNF168 recruitment, the DSB-associated K63 
ubiquitylation of H1 may play a part in facilitating chromatin remod- 
elling to allow efficient repair. 

H1-type histones consist of almost 30% lysine residues”, and pro- 
teomic studies have shown that many of these can be ubiquity- 
lated”**'”°. Because UBC13 only generates few de novo ubiquitylation 
marks, it is conceivable that RNF8-UBC13 mainly extends pre-existing 
H1 ubiquitylations in response to DNA damage, rendering identifica- 
tion of the key H1 sites targeted by RNF8-UBC13-dependent K63 
ubiquitylation challenging. Although numerous post-translational 
modifications (PTMs) have been mapped on different H1 isoforms”, 
these marks have not so far been connected directly to the ‘histone code’ 
specifying protein recruitment to different chromatin states'*’’. In 
addition to providing the first insights into the functional role of H1 
ubiquitylation, our findings also show that linker histones can indeed 
form part of a dynamic histone code for DNA repair, wherein RNF8 
and RNF168 function as writer and reader, respectively, of K63- 
ubiquitylated H1. The multitude of site-specific H1 PTMs raises the 
possibility that some of these marks may have an integral role in the 
histone code, an area of study warranting further investigation. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Plasmids. cDNAs encoding K63-Super-UIM (wild type and mutant) and the 
Vps27-based K63 binder'* containing C-terminal His, tags were produced as 
synthetic genes (Eurofins) and inserted into pDONR221 by BP reactions 
(Invitrogen). By means of LR reactions (Invitrogen) the inserts were then trans- 
ferred to the Champion pET104 BioEase Gateway Biotinylation System 
(Invitrogen) for recombinant protein production or pcDNA-DEST53 
(Invitrogen) for GFP-tagged constitutive mammalian expression. For inducible 
expression of GFP-tagged K63-Super-UIM, the GFP-K63-Super-UIM comple- 
mentary DNA was inserted into pcDNA4/TO (Invitrogen). Plasmids encoding 
HA-tagged wild-type and catalytically inactive (CI) RNF8 (C403S), wild-type and 
CI (C16S/C19S) forms of RNF168, and UBC13, as well as chimaeras between 
RNFS8 and different E2 enzymes were described previously’*'*. The *FHA muta- 
tion (R42A) in HA~-RNF8AR-UBC13 was generated by site-directed mutagenesis. 
RNF8 constructs were made resistant to RNF8-siRNA by introducing three 
silent mutations (bold) in the siRNA targeting sequence (5’-TGCGGAGTA 
TGAGTACGAG-3’) in the plasmids by site-directed mutagenesis. The RNF168 
UDM1 (amino acids 110-201) and UDM2 (amino acids 419-487) fragments were 
amplified by PCR and inserted into either pTriEx-5 (Novagen) for Strep- and His- 
tagged expression in Escherichia coli and mammalian cells, or pEGFP-Cl 
(Clontech) for expression of GFP-tagged versions. The Strep-RNF168 UDM1 
mutants used in this study (*UMI (L149A) and *MIU1 (A179G)) were generated 
using the QuikChange site-directed mutagenesis kit (Stratagene). Constructs 
encoding GFP-H1 isoforms were cloned by inserting the respective cDNAs into 
the BglII and BamHI sites of pEGFP-C1 (Clontech). A plasmid encoding 
HMGB1-GFP was provided by M. Bianchi. A Flag-HMGB1 expression construct 
was generated by inserting the HMGB1 open reading frame (ORF) into pFlag- 
CMV2 (Sigma). All constructs were verified by sequencing. Plasmid transfections 
were done with FuGene 6 (Promega) or Genejuice (Novagene), siRNA transfec- 
tions were done with Lipofectamine RNAiMAX (Invitrogen), according to the 
manufacturers’ instructions. 

siRNA. siRNA sequences used in this study were as follows. Non-targeting control 
(CTRL), 5’-GGGAUACCUAGACGUUCUATT-3’; UBC13, 5’-GAGCAU 
GGACUAGGCUAUATT-3’; RNF8, 5’-UGCGGAGUAUGAAUAUGAATT-3’; 
RNF168, 5'-GUGGAACUGUGGACGAUAATT-3’ or 5'-GGCGAAGAGCG 
AUGGAAGATT-3’; histone H1(#1), 5’-GCUACGACGUGGAGAAGAATT-3’; 
H1(#2), 5’-GCUCCUUUAAACUCAACAATT-3';  H1(#3),  5’-GAAGCC 
AAGCCCAAGGUUATT-3’; H1(#4), 5'-CCUUUAAACUCAACAAGAATT- 
3’; H1(#5), 5'-CCUUCAAACUCAACAAGAATT-3'; H1(#6), 5’-UCAAGAG 
CCUGGUGAGCAATT-3’; H1(#7), 5'-GGACCAAGAAAGUGGCCAATT-3’; 
H1(#8), 5'-GCAUCAAGCUGGGUCUCAATT-3’; H1(#9), 5’-CAGUGAAA 
CCCAAAGCAAATT-3’; H1(#10) (specific for H1x), 5’-CCUUCAAGCUC 
AACCGCAATT-3'; 53BP1, 5’-GAACGAGGAGACGGUAAUATT-3'; USP7, 
5'-GGCGAAGUUUUAAAUGUAUTT-3’; and USP9x, 5'-GCAGUGAGUGG 
CUGGAAGUTT-3’. 

Cell culture. Human U20S, HCT116 and RPE1 cells were obtained from ATCC. 
U20S and HCT116 were cultured in DMEM containing 10% FBS and 1 Xpeni- 
cillin-streptomycin, while RPE1 cells were grown in a 1:1 mixture of Ham’s F12 
and DMEM supplemented with 10% FBS and 1 X penicillin-streptomycin. Serum- 
starvation of RPE] cells was done by incubating cells for 24 h in medium supple- 
mented with 0.25% FBS. A HCT116 UBC13-knockout cell line was generated 
using CRISPR-Cas9 technology'*”’. A donor plasmid bearing a splice acceptor 
site and a puromycin resistance marker, flanked by homology arms, was co- 
transfected with pX300 (ref. 14) targeting the GGCGCGCGGGAATCGCGGCG 
sequence within the first intron of the UBC13 gene. To generate cell lines capable 
of doxycycline-induced expression of GFP-tagged K63-Super-UIM, U20S cells 
were transfected with GFP-K63-Super-UIM plasmid and pcDNA6/TR and pos- 
itive clones were selected with Zeocin (Invitrogen) and Blasticidin S (Invitrogen). 
Stable U2OS cell lines expressing RNF8 or RNF168 shRNA in a doxycycline- 
inducible manner or Strep-HA-ubiquitin were described previously'**'. All cell 
lines were regularly tested for mycoplasma infection. Unless otherwise indicated, 
cells were exposed to DSBs using IR (4 Gy for microscopy experiments and 10 Gy 
for biochemical analyses) or laser micro-irradiation (as described previously*), 
and collected 1 h later. 

Recombinant protein production. Purified biotinylated K63-Super-UIM wild- 
type and mutant proteins containing an N-terminal, biotinylated BioEase tag and 
a C-terminal His¢-tag were obtained by expressing the proteins in an E. coli strain 
expressing the BirA biotin ligase. Bacteria were grown in LB medium containing 
0.5 mM biotin, induced with 0.25 mM isopropyl-f-p-thiogalactoside (IPTG) for 
3 hat 30 °C, and then lysed by French press. The K63-Super-UIM constructs were 
purified using immobilized metal affinity chromatography (IMAC) followed by 
size-exclusion chromatography (SEC). Purity and complete biotinylation of the 
proteins was verified by mass spectrometry. Recombinant Strep—Hiss—RNF168 


UDM.-1/2 was produced in Rosetta2(DE3)pLacl (Novagen) bacteria induced with 
0.5mM IPTG for 3h at 30°C, lysed using Bugbuster (Novagen) supplemented 
with Protease Inhibitor Cocktail without EDTA (Roche). The proteins were puri- 
fied on Ni**-NTA-agarose (Qiagen). Recombinant human UBA1, UBCH5c, 
UBC13, MMS2, RNF8 and ubiquitin used for in vitro ubiquitylation assays were 
purified as described®. 

Antibodies. Antibodies used in this study included: UBC13 (#4919, Cell 
Signaling), MCM6 (sc-9843, Santa Cruz), 53BP1 (sc-22760, Santa Cruz), 
y-H2A.X (05-636, Millipore; or 2577, Cell Signaling), H2A.X (2595, Cell 
Signaling), MDC1 (ab11171, Abcam), conjugated ubiquitin (FK2) (BML- 
PW$8810-0500, Enzo Life Sciences), HA (11867423991, Roche; and sc-7392, 
Santa Cruz), Myc (sc-40, Santa Cruz), Hiss (631212, Clontech), GFP (sc-9996, 
Santa Cruz; 11814460001, Roche), ubiquitin (sc-8017, Santa Cruz), histone H1.2 
(ab17677, Abcam), histone H1x (A304-604A, Bethyl Labs), histone H1 (pan, #AE- 
4 clone) (ab7789, Abcam), histone H2A (07-146, Millipore), histone H2B (ab1790, 
Abcam), histone H3 (ab1791, Abcam), histone H4 (ab7311, Abcam), cyclin A 
(sc-751, Santa Cruz), actin (MAB1501, Millipore), BRCA1 (sc-6954, Santa 
Cruz), RNF168 for immunofluorescence (06-1130, Millipore) and antibody to 
RNF168 (ref. 5) used for immunoblots were gifts from D. Durocher. Antibody 
to RNF8 has been described previously’. 

Immunochemical methods. For pull-down of K63-ubiquitylated proteins, cells 
were lysed in high-stringency buffer (50 mM Tris, pH 7.5; 500 mM NaCl; 5mM 
EDTA; 1% NP40; 1 mM dithiothreitol (DTT); 0.1% SDS) containing 1.25 mg ml t 
N-ethylmaleimide, 50 uM DUB inhibitor PR619 (LifeSensors), and protease inhib- 
itor cocktail (Roche). Recombinant biotionylated K63-Super-UIM (25 pg ml ') 
was added immediately upon lysis, followed by sonication and centrifugation. 
Streptavidin M-280 Dynabeads (Invitrogen) was added to immobilize the K63- 
Super-UIM, and bound material was washed extensively in high-stringency buffer. 
A Benzonase (Sigma) and MNase (NEB) treatment step was included to remove 
any contaminating nucleotides. Proteins were resolved by SDS-PAGE and ana- 
lysed by immunoblotting. Where indicated, bound complexes were subjected to 
deubiquitylation by incubation with USP2cc (11M, Boston Biochem) in DUB 
buffer (50 mM HEPES, pH 7.5; 100mM NaCl; 1mM MnCl; 0.01% Brij-35; 
2mM DTT) overnight at 30°C before boiling in Laemmli Sample Buffer. 
Immunoblotting, Strep-Tactin pull-downs, and chromatin enrichment were done 
essentially as described’. Briefly, Strep-RNF168 UDM pull-down experiments 
from cells were performed after lysing cells in EBC buffer (50mM Tris, pH 7.4; 
150 mM NaCl; 0.5% NP-40; 1 mM EDTA) containing 1.25 mg ml! NEM, 50 pM 
PR619 (LifeSensors) and protease inhibitor cocktail (Roche). The soluble fraction 
was subsequently used for immunoprecipitation using Strep-Tactin sepharose 
(IBA). After washing in EBC buffer, proteins were eluted and analysed by immu- 
noblotting. To isolate Strep-HA-ubiquitin-conjugated proteins, cells were lysed in 
denaturing buffer (20 mM Tris, pH 7.5; 50 mM NaCl; 1 mM EDTA; 1 mM DTT; 
0.5% NP-40; 0.5% sodium deoxycholate; 0.5% SDS) containing 1.25 mg ml! 
NEM, 50 tM PR619 (LifeSensors) and protease inhibitor cocktail (Roche). After 
sonication and centrifugation, Strep-HA-ubiquitin-conjugated proteins were 
immobilized on Strep-Tactin sepharose (IBA). After extensive washing in dena- 
turing buffer, proteins were eluted and analysed by immunoblotting. For chromatin 
fractionation, cells were first lysed in buffer 1 (100 mM NaCl; 300 mM sucrose; 
3mM MgCl; 10 mM PIPES, pH 6.8; 1 mM EGTA; 0.2% Triton X-100) containing 
protease, phosphatase and DUB inhibitors and incubated on ice for 5 min. After 
centrifugation, the soluble proteins were removed and the pellet was resuspended in 
buffer 2 (50 mM Tris-HCl, pH 7.5; 150 mM NaCl; 5 mM EDTA; 1% Triton X-100; 
0.1% SDS) containing protease, phosphatase and DUB inhibitors. Lysates were 
then incubated 10 min on ice, sonicated, and solubilized chromatin-enriched frac- 
tions were collected after centrifugation. 

Immunofluorescence staining and microscopy. For immunofluorescence stain- 
ing, cells were fixed in 4% paraformaldehyde for 15 min, permeabilized with PBS 
containing 0.2% Triton X-100 for 5 min, and incubated with primary antibodies 
diluted in DMEM for 1 h at room temperature. After staining with secondary 
antibodies (Alexa Fluor; Life Technologies) for 1h, coverslips were mounted in 
Vectashield mounting medium (Vector Laboratories) containing nuclear stain 
DAPI. Images of GFP-K63-Super-UIM were all obtained from a stable cell line 
where GFP-K63-Super-UIM was induced by incubating with 1 pg ml~* doxycy- 
cline for approximately 24 h unless otherwise stated. Images were acquired with an 
LSM 780 confocal microscope (Carl Zeiss Microimaging) mounted on Zeiss- 
Axiovert 100M equipped with Plan-Apochromat 40X/1.3 oil immersion object- 
ive, using standard settings. Image acquisition and analysis was carried out with 
ZEN2010 software. For ImageJ-based image analysis, images were acquired with 
an AF6000 wide-field microscope (Leica Microsystems) equipped with a Plan- 
Apochromat 40X/0.85 CORR objective, using the same microscopic settings. 
Fluorescence intensities of the micro-irradiated region (demarcated by y-H2AX 
positivity) and the nucleus were first corrected for the general image background. 
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Using these values, relative recruitment to DNA damage sites (relative fluor- 
escence units (RFUs)) was calculated by normalizing the nuclear-background- 
corrected signal at the micro-irradiated region to that of the nuclear background. 
Finally, the RFU of the protein of interest was normalized to the RFU of the 
y-H2AX signal and plotted as the average of biological triplicates. Fluorescense 
recovery after photobleaching (FRAP) was performed essentially as described”’. 
Briefly, U2OS cells stably expressing GFP-H1 were grown in glass-bottom dishes 
(LabTek) in the presence of CO,-independent medium. A 2-m-wide rectangular 
strip spanning the entire width of the cell was bleached by excitation with the 
maximal intensity of a 488 nm laser line, after which 95 frames of the bleached 
region were acquired at 4 s intervals. Mean fluorescence intensities were processed, 
normalized and analysed as described”’. 

In vitro binding and ubiquitylation assays. Binding of K63-Super-UIM to di- 
ubiquitin (Ub2) linkages (Boston Biochem) was done by incubating 100 ng Ub2 
with 2.5 ug K63-Super-UIM immobilized on Streptavidin M-280 Dynabeads 
(Invitrogen) in buffer A (50mM Tris, pH 7.5; 10% glycerol; 400mM NaCl; 
0.5% NP40; 2mM DTT; 0.1 mg ml! BSA). After extensive washing, bound 
complexes were resolved by SDS-PAGE and analysed by immunoblotting. 
Binding of RNF168 UDM1/2 to di-ubiquitin (Ub) linkages was analysed by 
incubating 100 ng Ub, with 51g Strep-RNF168-UDM1/2 immobilized on 
Strep-Tactin sepharose (IBA BioTAGnology) in buffer B (50mM Tris, pH 8; 
5% glycerol; 0.5% NP40; 2 mM DTT; 0.1 mg ml — 'BSA;2mM MgCl), supplemen- 
ted with 250 mM KC] for UDM1 binding and 100 mM KCl for UDM2 binding). 
After extensive washing, bound complexes were resolved by SDS-PAGE and 
analysed by immunoblotting. Where indicated, UDM1/2 binding to K63-linked 
Ub» was analysed in the presence of increasing KCl concentrations (75 mM, 
150 mM and 250 mM). To analyse binding of RNF168 UDM1/2 to recombinant 
histones, purified Strep-RNF168 UDM1/2 (10 1g) was pre-bound to Strep-Tactin 
sepharose in buffer C (for binding to H1.0) (50mM, Tris pH 8; 5% glycerol; 
150mM KC]; 0.5% NP40; 2mM DTT; 0.1 mg ml! BSA) or D (for binding to 
H2A) (50 mM, Tris pH 8; 5% glycerol; 75mM KCl; 0.05% NP40; 2mM DTT; 
0.1 mg ml’ BSA), and incubated with 500 ng recombinant histone H1.0 or H2A 
(New England Biolabs). Bound complexes were washed and analysed by immu- 
noblotting. To analyse binding of LRM1 and LRM2 peptides to histone H1.0 or 
H2A, magnetic Streptavidin beads were incubated with buffer E (25 mM, Tris pH 
8.5; 5% glycerol; 50 mM KCI; 0.5% TX-100; 1 mM DTT; 0.1 mg ml! BSA) in the 
absence (control) or presence of 1.5 41g purified, biotinylated RNF168 LRM1 
(amino acids 110-133) or LRM2 (amino acids 463-485) peptide. Samples were 
then incubated with 250 ng recombinant H2A or H1.0 for 2h at 4 °C, and immo- 
bilized complexes were washed and analysed by SDS-PAGE and Colloidal Blue 
staining (Invitrogen). 

For in vitro ubiquitylation assays, histone-H1-containing oligonucleosomes 
(10 1M) were purified in the presence of 55mM iodoacetamide, essentially as 
described previously”, with the exception that micrococcal nuclease digestion was 
stopped with 20 mM EGTA and dialysis was started right after the second homo- 
genization in buffer containing 50 mM Tris, pH 7.5; 150 mM NaCl; 1 mM TCEP; 
and 340 mM sucrose. Dialysed samples were then incubated with DUB inhibitor 
(Ubiquitin-PA**, 20 1M) for 20 min at room temperature. Nuclesomes were incu- 
bated with 0.5 uM human UBA1, 5 14M UBCH5c, 1 4M UBC13-MMS2 complex, 
5 uM RNF835) _4g5 fragment (purified as described previously*) and 75 UM ubi- 
quitin in reaction buffer (50 mM Tris, pH 7.5; 100 mM NaCl; 3mM ATP; 3mM 
MgCl; 1 mM TCEP) at 31°C. Samples were analysed by immunoblot analysis. 
SILAC-based quantification of di-glycine-containing peptides. For SILAC 
experiments, U2OS or HCT116 cells were grown in medium containing unla- 
belled L-arginine and t-lysine (Arg’/Lys’) as the light condition, or isotope- 
labelled variants of L-arginine and t-lysine (Arg®/Lys* or Arg’'°/Lys®) as the heavy 
condition’®. SILAC-labelled HCT116 wild-type and UBC13-knockout cells were 
lysed in modified RIPA buffer (50mM Tris-HCl, pH 7.5; 150mM NaCl; 1% 
Nonidet P-40; 0.1% sodium-deoxycholate; 1mM EDTA) supplemented with 
protease inhibitors (complete protease inhibitor mixture tablets, Roche 
Diagnostics) and N-ethylmaleimide (5 mM). Lysates were incubated for 10 min 
on ice and cleared by centrifugation at 16,000g. An equal amount of protein from 
the two SILAC states was mixed and precipitated by adding fivefold acetone and 
incubating at —20 °C overnight. Precipitated proteins were dissolved in denatur- 
ing buffer (6 M urea; 2 M thiourea; 10 mM HEPES, pH 8.0), reduced with DTT 
(1 mM) and alkylated with chloroacetamide (5.5 mM). Proteins were digested with 
lysyl endoproteinase C (Lys-C) for 6 h, diluted fourfold with water and digested 
overnight with trypsin. The digestion was stopped by addition of trifluoroacetic 
acid (0.5% final concentration), incubated at 4°C for 2h and centrifuged for 
15 min at 4,000g. Peptides from the cleared solution were purified by reversed- 
phase Sep-Pak C18 cartridges (Waters Corporation). Diglycine-lysine modified 
peptides were enriched using the Ubiquitin Remnant Motif Kit (Cell Signaling 
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Technology), according to the manufacturer’s intructions. Briefly, peptides were 
eluted from the Sep-Pak C18 cartridges with 50% acetonitrile, which was subse- 
quently removed by centrifugal evaporation. Peptides were incubated with 40 ul of 
anti-di-glycine-lysine antibody resin in immunoaffinity purification (IAP) buffer 
for 4h at 4°C. Beads were washed three times with IAP buffer, two times with 
water and peptides eluted with 0.15% trifluoroacetic acid. Eluted peptides were 
fractionated by microcolumn-based strong cation exchange chromatography 
(SCX) and cleaned by reversed-phase C18 stage-tips. 

SILAC K63-Super-UIM pull-down and in-gel digestion. SILAC-labelled cells 
were lysed in high-stringency RIPA buffer (50mM Tris-HCl, pH 7.5; 500 mM 
NaCl; 1% Nonidet P-40; 0.1% sodium-deoxycholate; 1 mM EDTA) containing 
1.25 mg ml? N-ethylmaleimide, 50 uM DUB inhibitor PR619 (LifeSensors), 
and protease inhibitor cocktail (Roche). Lysates from different SILAC states were 
separately incubated for 10 min on ice and cleared by centrifugation at 16,000g. 
Extracts (5 mg) were incubated for 4h at 4 °C with K63-Super-UIM immobilized 
to Streptavidin beads (approximately 5 1g K63-Super-UIM per experiment). 
Beads were washed three times with high-stringency RIPA, beads from the dif- 
ferent SILAC conditions were mixed, and proteins were eluted with SDS sample 
buffer, incubated with DTT (10mM) for 10 min at 70°C and alkylated with 
chloroacetamide (5.5mM) for 60min at 25°C. Proteins were separated by 
SDS-PAGE using a 4-12% gradient gel and visualized with colloidal blue stain. 
Gel lanes were sliced into six pieces, and proteins were digested in-gel using 
standard methods”. 

Mass spectrometry and data analysis. Peptides were analysed on a quadrupole 
Orbitrap (Q Exactive, Thermo Scientific) mass spectrometer equipped with a 
nanoflow HPLC system (Thermo Scientific). Peptide samples were loaded onto 
C18 reversed-phase columns and eluted with a linear gradient (1-2 h for in-gel 
samples, and 3-4h for di-glycine-lysine enriched samples) from 8 to 40% acet- 
onitrile containing 0.5% acetic acid. The mass spectrometer was operated in a 
data-dependent mode automatically switching between MS and MS/MS. Survey 
full scan MS spectra (m/z 300-1200) were acquired in the Orbitrap mass analyser. 
The 10 most intense ions were sequentially isolated and fragmented by higher- 
energy C-trap dissociation (HCD). Peptides with unassigned charge states, as well 
as peptides with charge state less than +2 for in-gel samples and +3 for di-glycine- 
lysine enriched samples were excluded from fragmentation. Fragment spectra 
were acquired in the Orbitrap mass analyser. Raw MS data were analysed using 
MaxQuant software (version 1.3.9.21). Parent ion and tandem mass spectra were 
searched against protein sequences from the UniProt knowledge database using 
the Andromeda search engine. Spectra were searched with a mass tolerance of 
6 ppm in the MS mode, 20 ppm for MS/MS mode, strict trypsin specificity and 
allowing up to two missed cleavage sites. Cysteine carbamidomethylation was 
searched as a fixed modification, whereas amino-terminal protein acetylation, 
methionine oxidation and N-ethylmaleimide modification of cysteines, and di- 
glycine-lysine were searched as variable modifications. Di-glycine-lysines were 
required to be located internally in the peptide sequence. Site localization prob- 
abilities were determined using MaxQuant (PTM scoring algorithm) as described 
previously’*. A false discovery rate of less than 1% was achieved using the target- 
decoy search strategy” and a posterior error probability filter. Information about 
previously known protein-protein interactions among putative UBC13-depend- 
ent K63-Super-UIM interacting proteins was extracted using the HIPPIE data- 
base“ (version 1.6), and interactions were visualized in Cytoscape*'. The Gene 
Ontology (GO) biological process term analysis for UBC13-dependent K63- 
Super-UIM interacting proteins was filtered for categories annotated with at least 
20 and not more than 300 genes. Redundant GO terms (less than 30% unique 
positive-scoring genes compared to more significant GO term) were removed and 
the five most significant (Fisher’s exact t-test) remaining GO term categories 
depicted. To determine the variation within the quantification of ubiquitin linkage 
types, an F-test was performed and the P values were adjusted using the Bonferroni 
method. A significant difference in the variances between K48 and K11, and K48 
and K6 ubiquitin linkages was detected. To test the significance of the difference 
between the SILAC ratios measured for ubiquitin linkage types, the Welch two- 
sample t-test was performed and the obtained P values were adjusted using the 
Bonferroni method. 
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Extended Data Figure 1 | Protein recruitment to DSB sites in UBC13- siRNAs and subsequently with plasmids encoding UBC13 or siRNA-resistant 
knockout cells. a, b, Representative images of HCT116 wild-type (WT) or UBC13-RNFS8 fusion constructs (c) (n = 2). e, Representative images of 
UBC13-knockout (KO) cells exposed to IR. Where indicated, cells were UBC13-knockout cells transfected with plasmids encoding wild-type or 
transfected with HA-RNF8 plasmid before IR (n = 2 experiments). catalytically inactive (CI) HA-RNF168 (n = 2). Expression of HA- 
c, Constructs used in Fig. 1d and their ability to restore IR-induced 53BP1 foci RNF168 wild type restores IR-induced 53BP1 foci formation (arrows). Scale 
in UBC13-knockout cells. d, Representative images of HCT116 UBC13- bars, 10 um. 
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Extended Data Figure 2 | RNF8- and UBC13-dependent K63-linked 
ubiquitylation at DSB sites. a—c, Representative images of U2OS cells stably 
expressing GFP-K63-Super-UIM, transfected with siRNAs where indicated, 
and exposed to laser micro-irradiation or IR (n = 3). d, Representative images 
of U2OS cells stably expressing GFP- and nuclear localization signal (NLS)- 
tagged Vps27-UIM (with high affinity for binding to K63-linked ubiquitin'*) 
transfected with the indicated siRNAs and exposed to laser micro-irradiation 
(n = 2). e, Loss of RNF8 or UBC13 has no impact on 53BP1 abundance. 
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Immunoblot analysis of whole-cell extracts (WCEs) of U2OS cells transfected 
with indicated siRNAs. f, RNF8, but not RNF168, is modified by K63-linked 
ubiquitin chains. K63-Super-UIM pull-downs from U20S cells transfected 
with empty vector (—), HA-tagged RNF8 (wild type (WT) or catalytically 
inactive (CI)) or HA-RNF168 plasmids were immunoblotted (IB) with 
indicated antibodies. Scale bars, 10 jum. e, f, Uncropped blots are shown in 
Supplementary Fig. 1. 
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Extended Data Figure 3 | Experimental replicates of SILAC-based 
quantification of di-glycine-containing peptides and K63-Super-UIM pull- 
downs from wild-type and UBC13-knockout cells. a, Schematic outline of 
SILAC-based mass spectrometry approach to quantify di-glycine-containing 
peptides in HCT116 wild-type (WT) and UBC13-knockout (KO) cells. 

b, c, Proportional Venn diagrams showing overlap between all identified di- 
glycine-containing peptides (b) and those with a SILAC ratio (UBC13- 
knockout/wild-type cells) <0.5 (c) in two independent experiments (Exp.) 
performed as shown in a (Supplementary Table 1). d, Scatter plot showing 
correlation between SILAC ratios of di-glycine-containing peptides. The 
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Pearson’s correlation coefficient (R) is indicated. e, Schematic outline of 
SILAC-based mass spectrometry approach to identify UBC13-dependent 
K63-ubiquitylation targets in unperturbed HCT116 wild-type and UBC13- 
knockout cells. f, g, Proportional Venn diagrams showing overlap between all 
proteins identified in K63-Super-UIM pull-downs (f) and those with a SILAC 
ratio (UBC13-knockout/wild-type cells) <0.5 (g) in two independent 
experiments performed as shown in e (Supplementary Table 2). h, Scatter plot 
showing correlation between SILAC ratios of proteins identified in two 
experiments. The Pearson’s correlation coefficient (R) is indicated. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a S10047 
SLG7A11 — TALBO1 
obs rugs Mage 
ri; NTHROY_ oD 
PNP DSC3 6. ARG1 
PI 
ee ch RABIIA ee udiits  OTSD DNAlicr 
RPN2 
POs FIsi ANKA2 TXN GRE? ATP6VOA1 GH 
Hilts PHB2 MARCHS ~*~ HAX1 2 i ATP6VOD1 al 
ATP6V1B2 1 
ABHDIGA eiB31 pzips ZNFF1 ATBYIA ATP6VIC1 DE 
YBX1 BERK ATPEVIE1 
e DiABLO qe Be OP nae x ATPEV1G1 
NADSYN' UBB “% o as sTuBt ATPHA1 LGALs3 L@Aust 
2 RNF216 TAAF2 SS. cB 
EPBAILS FPN sedis RE op HP are APEH 
Pp -SMADS yar 1KBRG Pa OSBPL10 
crilyo: seve ee eae © 
RALA REPS! di CAMK2D EPSi AP2A1 pPYS\2 
CAN KAT2 GoLGas nie 
"Og ToERG: "1B 
6 
or" IGHG1 UBXNG Pd sy @ TIMM8B. TIMM13. TIMMBA 
LYN IGKC cay STXS\iB2 PolRes 
SHOBF! ADAM15 — MCL1 DYNGILIT — MiB+ Gi FOPNL pike 0uGAP4 DL: 
DAD 1PRKGDEP ITPRS anil cA TU Aah 
TRIM4 
b GO biological process 
ATP hydrolysis coupled protein transport 
Protein K63-linked ubiquitylation 
Positive regulation of 
I-KB kinase/NF-kB signaling 
Protein K48-linked ubiquitylation 
Vesicle docking 
0.0 25 5.0 75 10.0 
— log10(P value) 
c d 
Light (L) SILAC Heavy (H) SILAC ee SILAC ratio 
(Arg°/Lys°) (Arg’/Lys®) (H/L) 
Goes eee 
leencaaal TRAF6 3.41 
Mock IR 
- : TRAF2 2.91 
HP1BP3 2.62 
. Histone H1.2 1.85 
— tease Histone H1x 1.66 
Histone H1.0 1.40 
Pull-down: K63-Super-UIM CENPF 1.36 
SUMO1 1.36 
In-gel digestion Histone H2A 1.10 
Histone H3 1.09 
LC-MS/MS analysis Histone H2B (0.96 
Histone H4 0.88 


Extended Data Figure 4 | Analysis of UBC13-dependent K63-ubiquitylated 
proteins in unperturbed cells and in response to DNA damage. a, K63 
linkages from extracts of HCT116 wild-type (WT) and UBC13-knockout (KO) 
cells were enriched by K63-Super-UIM pull-down (Supplementary Table 2). 
Interaction network shows proteins enriched at least twofold in wild-type cells. 
Proteins involved in UBC13-dependent activation of NF-«B signalling are 


highlighted in red. b, Functional annotation of potential UBC13-dependent 
K63-ubiquitylated proteins (a), showing enriched Gene Ontology (GO) 
biological process terms. c, Schematic outline of SILAC-based mass 
spectrometry approach to identify targets of K63 ubiquitylation in response 
to IR-induced DSBs. d, SILAC ratios of selected proteins from U2OS cells 
treated as in c. Data from a representative experiment are shown (n = 3). 
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Extended Data Figure 5 | DSB-induced K63 ubiquitylation of H1-type 
linker histones. a, Analysis of K63-linked ubiquitylation of histone H1.2 in 
RPEI cells growing exponentially or kept quiescent by serum starvation. 

b, c, K63-Super-UIM pull-downs from U20S cells exposed or not to IR were 
immunoblotted (IB) with the indicated antibodies. d, U2OS cells or U2OS cells 
stably expressing Strep-HA-ubiquitin (U2OS/Strep-Ub) were transfected 
with the indicated siRNAs and exposed or not to IR. Whole-cell extracts 
(WCEs) and Strep-ubiquitin-conjugated proteins immobilized on Strep- 
Tactin beads under denaturing conditions were analysed by immunoblotting. 
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e, Proteins interacting with endogenous RNF8. U2OS cells stably expressing 
RNE8 shRNA in a doxycycline (DOX)-inducible manner’ was grown in 

light (L) or heavy (H) SILAC medium. Cells growing in light medium were 
induced to express RNF8 shRNA by treatment with DOX. Both cultures were 
then exposed to IR and processed for immunoprecipitation with RNF8 
antibody. Bound proteins were analysed by mass spectrometry. Proteins 
displaying the highest H/L SILAC ratios are listed. a-d, The migration of 
molecular weight markers (kDa) is indicated on the left. Uncropped blots are 
shown in Supplementary Fig. 1. 
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Extended Data Figure 6 | Knockdown of H1-type histones impairs 
accumulation of K63-linked ubiquitin conjugates, RNF168 and BRCA1 at 
DSB sites. a, Immunoblot analysis of U2OS cells transfected sequentially with 
plasmids encoding GFP-tagged histone H1 isoforms (H1.0-H1.5) and the 
indicated siRNAs. Knockdown efficiency of the siRNAs (#1, #4, #5 and #9) used 


in the pan-H1 siRNA cocktail to reduce global H1 expression level is 
highlighted in red boxes. b, Immunoblot analysis of U2OS cells transfected with 
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H1 siRNA cocktail (a). c-g, Representative images of siRNA-transfected 
U20S/GFP-K63-Super UIM (c) or U20S cells (d-g) exposed to IR or laser 
micro-irradiation (n = 2). h, Analysis of IR-induced y-H2AX ubiquitylation 
(Ub) by RNF168 (marked by arrow) in chromatin fractions of U2OS cells 
transfected with indicated siRNAs. Scale bars, 10 jim. a, b, h, Uncropped blots 
are shown in Supplementary Fig. 1. 
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Extended Data Figure 7 | HMGB1 overexpression impairs the RNF8/ (n = 3). Arrows indicate cells expressing HA-RNF168 CI, in which 53BP1 foci 
RNF168-dependent signalling response at DSB sites at the level of K63 formation is not restored. c, Representative images of U20S/GFP-K63-Super- 
ubiquitylation and RNF168 recruitment. a, b, Representative images of UIM cells transfected with Flag~-HMGBI construct and subjected to laser 
U20S cells co-transfected with constructs encoding HMGB1-GFP and micro-irradiation (n = 3). Flag-HMGB1-expressing cells show reduced K63 


wild-type (WT) or catalytically inactive (CI) HA-RNF168 and exposed toIR —_ ubiquitylation at DSB sites (indicated by arrows). Scale bars, 10 jim. 
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Extended Data Figure 8 | Interaction of RNF168 UDM1 with K63- 
ubiquitylated H1. a, Immunoblot analysis of immobilized recombinant 
RNF168 UDM1 or UDM2 incubated with K63 linked di-ubiquitin (Ub>) in the 
presence of increasing salt concentrations (75 mM, 150 mM and 250 mM KCI, 
respectively). b, Binding of immobilized recombinant Strep-UDM2 or empty 
Strep-Tactin beads to indicated di-ubiquitin (Ub) linkages was analysed by 
immunoblotting. c, Biotinylated peptides corresponding to the LRM1 and 
LRM2 motifs in human RNF168 were analysed for binding to recombinant 
H2A or H1.0 in vitro by Streptavidin pull-down followed by SDS-PAGE and 


t 


*MIU1 (A179G) 


Colloidal Blue staining. The migration of molecular weight markers (kDa) is 
indicated on the left. d, Sequence of the UDM1 region in human RNF168, 
showing the location of the LRM1, UMI and MIU1 motifs. Acidic amino acids 
are highlighted in red. The sequence corresponding to the LRM1 peptide 

(c) and mutations introduced to generate UDM1 *UMI and *MIU1 (f) are 
indicated. e, f, Pull-down assays of Strep-tagged UDM1 and UDM2 constructs 
expressed in U2OS cells. a-c, e, f, Uncropped blots are shown in Supplementary 
Fig. 1. 
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expressed in U20S cells were immunoblotted (IB) with antibodies to indicated | overexpressed GFP-UDM1 (SILAC ratio (GFP-UDM1/mock) >1.5). Only 
histones. b, Localization pattern of GFP-tagged UDM1 expressed in U2OS one protein, histone H1x, was common to both of these subsets of cellular 
cells. Scale bar, 10 jim. c, Venn diagram showing overlap between proteins proteins. a, Uncropped blots are shown in Supplementary Fig. 1. 
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Extended Data Figure 10 | Impact of DSBs on H1 chromatin association. _resolubilized, EBC-insoluble fractions were incubated with recombinant K63- 
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cells left untreated or exposed to IR were lysed in EBC buffer. Soluble and 
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Anaerobic ammonium oxidation (anammox) has a major role in 
the Earth’s nitrogen cycle’ and is used in energy-efficient waste- 
water treatment®. This bacterial process combines nitrite and 
ammonium to form dinitrogen (N2) gas, and has been estimated 
to synthesize up to 50% of the dinitrogen gas emitted into our 
atmosphere from the oceans’. Strikingly, the anammox process 
relies on the highly unusual, extremely reactive intermediate 
hydrazine*, a compound also used as a rocket fuel because of its 
high reducing power. So far, the enzymatic mechanism by which 
hydrazine is synthesized is unknown. Here we report the 2.7A 
resolution crystal structure, as well as biophysical and spectro- 
scopic studies, of a hydrazine synthase multiprotein complex iso- 
lated from the anammox organism Kuenenia stuttgartiensis. The 
structure shows an elongated dimer of heterotrimers, each of which 
has two unique c-type haem-containing active sites, as well as an 
interaction point for a redox partner. Furthermore, a system of 
tunnels connects these active sites. The crystal structure implies a 
two-step mechanism for hydrazine synthesis: a three-electron 
reduction of nitric oxide to hydroxylamine at the active site of 
the y-subunit and its subsequent condensation with ammonia, 
yielding hydrazine in the active centre of the a-subunit. Our results 
provide the first, to our knowledge, detailed structural insight into 
the mechanism of biological hydrazine synthesis, which is of major 
significance for our understanding of the conversion of nitrogen- 
ous compounds in nature. 

Most nitrogen on earth occurs as gaseous N, (nitrogen oxidation 
number 0). To make nitrogen available for biochemical reactions, the 
inert Nz has to be converted to ammonia (oxidation number —III), 
which can then be assimilated to produce organic nitrogen com- 
pounds, or be oxidized to nitrite (oxidation number +IID) or nitrate 
(+V). The reduction of nitrite in turn results in the regeneration of N2, 
thus closing the biological nitrogen cycle. 

To produce N, from nitrite, a nitrogen-nitrogen bond must be 
formed by the addition of another nitrogen-containing molecule. At 
present, two biological processes are known that can achieve this. In 
denitrification, nitrite is first reduced to nitric oxide (NO, +I). Then, 
two molecules of NO are combined to produce nitrous oxide (NO, 
+1), which is subsequently reduced to N>. The other process, anaerobic 
ammonium oxidation or anammox'”’, was discovered only relatively 
recently, and relies on the combination of two compounds with differ- 
ent nitrogen oxidation states, nitrite and ammonium, to generate No. 

Our current understanding of the anammox reaction (equation (1)) 
is based on genomic, physiological and biochemical studies on the 
anammox bacterium K. stuttgartiensis*°. First, nitrite is reduced to 
nitric oxide (NO, equation (2)), which is then condensed with ammo- 
nium-derived ammonia (NH) to yield hydrazine (N2H4, equation (3)). 
Hydrazine itself is a highly unusual metabolic intermediate, as it is 
extremely reactive and therefore toxic, and has a very low redox poten- 


tial (Ep’ = —750 mV). In the final step in the anammox process, it is 
oxidized to No, yielding four electrons (equation (4)) that replenish 
those needed for nitrite reduction and hydrazine synthesis and are used 
to establish a proton-motive force across the membrane of the ana- 
mmox organelle, the anammoxosome, driving ATP synthesis (see ref. 6 
for a review). 


NO,~ +NHy*>N)+2H,0 AG”=—357kJ mol7’ (1) 
NO,” +2H* +e7 >-NO+H,0 £E)’=+0.38 V (2) 
NO+NH,* +2Ht +3e7 -N,)H,+H,O Ey’'=+0.06V (3) 


N,Hi>N,+4H* +4e7 Ey’ = —0.75 V (4) 


The enzyme producing hydrazine from NO and ammonium—hydra- 
zine synthase (HZS)—is biochemically unique. A complex of three 
proteins, HZS-a, -B and -y, encoded by the genes kuste2861, -2859 
and -2860, respectively, was put forward as the probable 
hydrazine synthase enzyme’. This complex was isolated from K. stutt- 
gartiensis cells and shown to be catalytically active in a coupled assay 
with the octahaem c-type cytochrome kustcl061 (ref. 7) to convert 
hydrazine into N, and return electrons to HZS*. Isolated HZS is a 
comparatively slow enzyme with an activity of 20 nmol h ’ mg ' 
protein, about 1% of in vivo turnover. This striking loss of activity 
occurs immediately upon cell lysis and might be explained by the 
disruption of a tightly coupled multicomponent system, as well as by 
the use of bovine cytochrome c as an artificial electron carrier in the 
in vitro assay*. 

Using a custom-designed crystal cooling method, we prepared well- 
diffracting crystals of the HZS-aBy multienzyme complex from 
K. stuttgartiensis and determined its crystal structure at 2.7 A resolu- 
tion in the absence of substrates (Fig. la and Extended Data Table 1). 
The structure reveals a crescent-shaped dimer of heterotrimers with an 
(aBy)2 stoichiometry. The overall size and shape of the complex were 
confirmed by analytical ultracentrifugation and solution small-angle 
X-ray scattering (Supplementary Information and Extended Data 
Fig. 1). Each heterotrimer contains four haems and one zinc ion, as 
well as several calcium ions (Fig. 1b, Supplementary Information and 
Extended Data Table 2). 

The a-subunit (Fig. 1c) consists of three domains: an N-terminal 
domain which includes a_ six-bladed f-propeller, a middle 
domain binding a pentacoordinated c-type haem (haem aI) and a 
C-terminal domain which harbours a bis-histidine-coordinated c-type 
haem (haem aII). The structure around haem aI (Fig. 1d) deviates 
substantially from a typical haem c site, as the canonical histidine of the 
haem c binding motif, «His587, is rotated away from the haem iron, 
and coordinates a zinc ion. Instead, the hydroxyl group of «Tyr591 
serves as the proximal ligand to the haem iron, as in the active site of 
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Figure 1 | Crystal structure of HZS. a, HZS complex structure; «-subunits 
are coloured green, B-subunits are blue and y-subunits are grey. 

b, Surface representation. The contact area between two heterotrimers 
(~1,350 A?) is made up of contributions from a- and B-subunits only. 
Considerable solvent-filled space remains between the heterotrimers. Calcium 
ions are labelled Ca, zinc as Zn. Edge-to-edge distances between the haems 
within a subunit are indicated in angstréms. c, Stereofigure of the #-subunit. 
The N-terminal domain (residues 028-420), middle domain (%421-«670) and 
C-terminal domain (671-808) are indicated in different shades of green. The 
two haem groups are shown as sticks. d, Stereofigure of the haem al site, 
overlaid with the simulated annealing composite omit map, contoured at 1.00. 
The zinc ion and its coordinating water are shown as grey and red spheres, 
respectively. The haem propionates are labelled Prp. 
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many catalases*. Importantly, this tyrosine is conserved in HZS-« 
sequences (Extended Data Fig. 2). The zinc bound to «His587 is 
further coordinated by one of the haem al propionate groups, as well 
as aCys303 and probably a water molecule, in a structure reminiscent of 
the active sites of alcohol dehydrogenase and various metalloproteases’. 
The zinc ion could play a structural role, assisting in rotating oHis587 
away from the iron, allowing «Tyr591 to bind, or could directly modu- 
late the chemistry of the haem group, with which it interacts via a 
propionate group. ¢Thr571, «Ala569 and aMet556 (which is partially 
oxidized, see Supplementary Information) are in close proximity to the 
distal side of haem aI, which does not seem to coordinate a solvent 
molecule in the crystal structure. In contrast, haem aI is bound by a 
canonical haem c binding motif and is coordinated by «His772 distally 
and oHis689 proximally. The edge-to-edge distance’® between haems 
ol and all is 31 A (Fig. 1b), which is too long for single-step electron 
transfer between the haem groups of the «-subunit. The edge-to-edge 
distances between the haem groups in the two different «-subunits in 
the complex are larger than 38 A, which excludes electron transfer 
between the two o-subunits on the timescale of catalysis. 

The non-haem f-subunit (Fig. 2a) is a seven-bladed B-propeller 
with a short helical insertion in the sixth propeller blade. The outer 
strand of the C-terminal blade consists of the N terminus (residues 
40-52) of the y-subunit of the same heterotrimer. Notably, the HZS 
B- and y-subunits are fused into a single polypeptide in the anammox 
bacteria Scalindua profunda and Scalindua brodae (ref. 11 and 
Extended Data Fig. 3). 

The structure of the y-subunit (Fig. 2b) is reminiscent of the fold of 
the homologous dihaem cytochrome c peroxidases (CCPs)'*"* and 
Paracoccus denitrificans methylamine utilization protein G (MauG)’* 
and consists of two o-helical lobes, each of which contains one c-type 
haem. Haem yI in the N-terminal lobe (Fig. 2c) is coordinated prox- 
imally by yHis106 and distally by a water molecule, and is covalently 
bound to yCys102 and yCys105 on a typical haem c binding motif. 
Intriguingly, the electron density maps clearly show a unique third 
covalent bond with the protein, between the C; porphyrin methyl 
group and the Sy sulfur atom of yCys165 (Extended Data Fig. 4a), 
which possibly serves to modulate haem chemistry. At the distal side, 
the iron binds a water molecule, which is hydrogen bonded to 
yAsp168. This conserved residue (Extended Data Fig. 3) is perfectly 
positioned to transfer protons to a ligand molecule coordinated to the 
haem. A structural superposition (Extended Data Fig. 4b) reveals that 
haem 71 is located at the position of the high-spin haem of the homo- 
logous Nitrosomonas europaea CCP* and P. denitrificans MauG”’. 

The bis-His-coordinated haem yII in the C-terminal lobe is located 
at the equivalent position as the electron transfer haem in CCPs and 
MauG (Extended Data Fig. 4b), at an edge-to-edge distance of 15 A 
from haem yI (Fig. 1b), which would allow direct electron transfer 
between the haems in the y-subunit. In CCPs and MauG, a conserved 
Trp residue is believed to be involved in catalytic redox chemistry. In 
HZS-y, the position of this tryptophan is taken up by yHis144. The 
y-subunit binds three calcium ions, one of them at the same position as 
the Ca-binding site in CCP that is essential for its activation. Moreover, 
haem II is located on the surface of the complex, exposed to the 
solvent, surrounded by a negatively charged patch, as in a cytochrome 
c binding site (Extended Data Fig. 5). Therefore, haem yII probably 
functions in electron transfer. 

Thus, it appears that the o.- and the y-subunit each contain an active 
site (haems aI and yI) and the y-subunit contains an electron-transfer 
site (haem yII). Electron paramagnetic resonance (EPR) spectroscopy 
(Extended Data Fig. 6 and Supplementary Information) is consistent 
with a stoichiometry of two bis-His-coordinated haems and two 
haems for which a population of ligation states exist. 

Intriguingly, our crystal structure revealed a tunnel connecting the 
haem al and YI sites (Fig. 3a). This tunnel branches off towards the 
surface of the protein approximately halfway between the haem sites, 
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Figure 2 | Structure of HZS-B and HZS-y. a, Structure of the B-subunit. The 
6245-260 insertion is shown in purple. The N terminus of the y-subunit, 
which engages in $-completion with the first blade of the B-propeller is shown 
in grey. b, Structure of the y-subunit. c, Stereofigure of haem I and its 
surroundings, overlayed with the simulated annealing composite omit map 
(blue, 1.0c). The water molecule bound to the haem iron is shown as a red 
sphere. The green mesh is the difference electron density calculated before 
inclusion of the water molecule in the model (5.0). 


making them accessible to substrates from the solvent. Indeed, binding 
studies show that haem al is accessible to xenon (Extended Data 
Fig. 4c). Interestingly, in-between the o- and y-subunits, the tunnel 
is approached by a 15-amino-acid-long loop of the B-subunit (B245- 
260), placing the conserved BG1u253, which binds a magnesium ion, 
into the tunnel. 

These observations allow a mechanism for biological hydrazine 
synthesis to be proposed (Fig. 3b). The presence of two active sites, 
connected by a tunnel, strongly suggests a mechanism with two half- 
reactions. HZS combines NO (nitrogen oxidation number +II) and 
NH, (N oxidation number —III). To reach the —II oxidation number 
of the nitrogen atoms in hydrazine, nitric oxide must be reduced. As 
proposed earlier’, this could happen in the y-subunit, resulting in the 
production of hydroxylamine (NH2OH; nitrogen oxidation 
number —I) according to equation (5). 


HI -I 
NO+3H*t +3e —+NH,OH (5) 


This three-electron reduction is consistent with the proposal that HZS 
obtains electrons from the trihaem cytochrome c kuste2854 (ref. 6). In 
this scheme, the electrons would enter HZS through haem II and be 
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Figure 3 | Proposed mechanism of biological hydrazine synthesis. a, Tunnel 
between the active site haems (orange, major tunnel) with the branch to the 
protein surface. The 8245-260 loop is shown in purple, as well as BGlu253 
which binds a magnesium ion (light-green sphere) and yHis144. A minor 
tunnel (lower right) leads to the zinc ion, and could allow ammonium to enter. 
b, Details of the proposed mechanism. NO travels to haem yI through the 
tunnel (orange) via the branch leading to the surface. On the left, three 
electrons enter the complex at haem yII and are conducted to haem yI via 
yHis144. Together with three protons reaching haem yI from the solvent via the 
buried polar cluster, the electrons reduce NO to NH,OH (grey box). NH,OH 
then diffuses through the tunnel, which is regulated by the B-subunit 
through the 8245-260 loop, and binds to haem al. There, it undergoes 
comproportionation with NH; to yield hydrazine (green box). c, Stereofigure, 
showing a model of hydroxylamine (HA) bound to haem aI in a very 
hydrophobic environment. 


transferred to the active site haem yI, possibly via yHis144. yAsp168 
could assist in adding the protons. A cluster of buried, polar residues 
(yAsp112, yArg143 and yArg167) is positioned between yAsp168 and 
the surface of the complex and could serve to transfer protons to the 
active centre of the y-subunit. 

In the proposed mechanism, hydroxylamine then diffuses through 
the tunnel to the «-subunit’s active site. Given the position of the 
6245-260 loop, the B-subunit could play a role in modulating trans- 
port through the tunnel. Hydroxylamine is isoelectronic with hydro- 
gen peroxide, and is a competitive catalase inhibitor’®. Thus, it would 
bind to the distal coordination site of the catalase-like haem al, 
which would polarize the N-O bond. As crystal soaking with 
NH2OH was unsuccessful, we constructed a model of this complex 
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(Fig. 3c) which shows that hydroxylamine would be bound in a tight, 
very hydrophobic pocket, so that there is little electrostatic shielding 
of the partial positive charge on the nitrogen. Ammonia produced 
from ammonium (the predominant form at pH=6.3 in the 
anammoxosome”’) could then perform a nucleophilic attack on the 
nitrogen of hydroxylamine, yielding hydrazine through compropor- 
tionation (equation (6)). 


I -IN —Il 
NH,0OH +NH;3 —>NoHy + H,0 (6) 


Hydrazine could leave the enzyme via the tunnel branch leading to the 
surface. 

Interestingly, the proposed scheme is analogous to the Raschig 
process used in industrial hydrazine synthesis. There, ammonia is 
oxidized to chloramine (NH,Cl, nitrogen oxidation number —I, like 
in hydroxylamine), which then undergoes comproportionation with 
another molecule of ammonia to yield hydrazine. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Protein purification. The kuste2859-2860-2861 hydrazine synthase (HZS) com- 
plex was purified from a planktonic K. stuttgartiensis culture as described prev- 
iously*. Briefly, cell-free extracts prepared from a ~95% single-cell enrichment 
culture of K. stuttgartiensis were subjected to ultracentrifugation (180,000g; 4 °C; 
1h) to pellet cell membranes. HZS present in the bright-red supernatant was 
brought to homogeneity by a two-step column purification procedure 
consisting of subsequent Q Sepharose XL (GE Healthcare) and CHT Ceramic 
Hydroxyapatite (Bio-Rad, USA) column chromatography steps. UV-Vis spectra 
of as-isolated HZS showed a Soret absorption peak at 406 nm and a broad band in 
the 530 nm region, which are typical for fully oxidized (ferric) haem c proteins. 
Reduction of the protein under anoxic conditions using sodium dithionite resulted 
in a shift of the Soret maximum to 420 nm as well as haem «- and B-bands at 
553nm and 523 nm, respectively. Protein concentrations used for ICP-MS and 
EPR measurements were determined using the Bradford assay (Bio-Rad, USA) 
with bovine serum albumin as standard. 

Analyses by MALDI-TOF and ESI-TOF mass spectrometry. The subunits of 
the HZS complex were separated by 15% sodium dodecylsulfate polyacrylamide 
gel electrophoresis (SDS-PAGE). To identify the individual subunits of the HZS 
complex, Coomassie-stained SDS gel slices were digested with trypsin or chymo- 
trypsin, followed by reduction with DTT and alkylation with iodoacetamide. The 
resulting peptides were purified and concentrated using a Millipore ZipTip C-18 
column, spotted onto solid targets with o-cyanocinnamic acid, and analysed 
by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry 
(MALDI-TOF MS) on an Axima TOF? Performance mass spectrometer 
(Shimadzu Biotech, Duisburg, Germany). Signal peptide cleavage sites were pre- 
dicted using the SignalP 3.0 Server'® applying Hidden-Markov models for Gram- 
negative bacteria. Liquid HZS samples were analysed by electrospray ionization 
time-of-flight mass spectrometry (ESI-TOF MS) on a maXis spectrometer 
(Bruker Daltonics) under denaturing conditions after diluting in 50% (v/v) acet- 
onitrile/0.1% formic acid and separation by reversed-phase high-performance 
liquid chromatography (RP-HPLC) using a Discovery BIO Wide Pore C5 column 
(20 X 2.1mm, 5 |tm particle size, Supelco) at a flow rate of 50 ul min™?. 

Metal analysis by inductively-coupled plasma mass spectrometry (ICP-MS). 
Metals were analysed by ICP-MS ona Series I ICP MS (Thermo Scientific, Breda, 
the Netherlands). Height point calibration was performed with a dilution series 
of (multi-) element standards (1,000 p.p.b. in 1% nitric acid; Merck, Darmstadt, 
Germany) using the PlasmaLab software (Thermo Scientific, Breda, the 
Netherlands). To determine the metal content of HZS, 70-300 ul of purified 
HZS (26 mg protein per ml) was washed with water using a Vivaspin 500 filter 
(Sartorius, Gottingen, Germany), destructed with 10% nitric acid at 90 °C for 90 
min and diluted to 10 ml with water. 

Analytical ultracentrifugation (AUC). Protein samples were concentrated in 
25 mM Hepes/KOH, pH 7.5, 25mM KCI to Ajs™ ~ 0.3 and Aj” ~ 0.45, corres- 
ponding to 0.3 mg ml’, as determined using the Bradford protein assay from Bio- 
Rad. Sedimentation velocity analytical ultracentrifugation was performed in a 
Beckman ProteomeLab XL-I (Beckmann Coulter, Krefeld, Germany) analytical 
ultracentrifuge equipped with an An60Ti rotor at 30,000 r.p.m. and 20 °C ina two- 
sector cell with a 1.2 cm optical path length. Absorption scan data were collected at 
280 nm and 406 nm and evaluated using SEDFIT”’. 

Protein crystallization and crystal treatment. Hydrazine synthase was concen- 
trated to 45mg ml! in 25 mM HEPES/KOH pH 7.5, 25 mM KCI by ultrafiltra- 
tion, divided into 50-ul aliquots, frozen in liquid nitrogen and stored at —80 °C. 
Prior to crystallization, the protein stock was diluted to 30 mg ml’ with 25 mM 
HEPES/KOH pH 7.5, 25 mM KCl. Crystallization was performed in 1 pl + 1 ul 
sitting drop vapour-diffusion setups at 8 °C, equilibrating against 500 pl 36% (v/v) 
1,4-dioxane. Dark red, rhombohedral crystals with dimensions up to 
400 x 400 x 100 uM grew within three days. The addition of 1mM 5-amino- 
2,4,6-triiodoisophthalic acid adjusted to pH 7 with ethanolamine to the precipitant 
solution in the drops yielded more crystals and accelerated crystal growth. Using 
PEG 400 or other conventional cryoprotectants, diffraction of these crystals suf- 
fered from diffuse scattering, limiting resolution to approx. 4A and precluding 
SAD phasing. Successful cryoprotection was carried out by soaking the crystals for 
10-30 s in 4 M betaine (N,N,N-trimethylglycine) in 50% (v/v) methanol at 8 °C, 
before flash-cooling in liquid nitrogen. These crystals showed sharp Bragg spots, 
were used for phasing and to build the initial model. However, as the crystals 
dissolved in the soaking solution at 8 °C, crystals were slowly cooled to —20 °C on 
a custom-designed Peltier-cooled microscope stage, which will be described in 
detail elsewhere. After incubation in the soaking solution at this temperature for 
up to 30min, crystals were flash-cooled in liquid propane at a temperature of 
approximately 150K. These crystals diffracted up to 2.7A resolution. Xenon 
treatment was performed in a —20°C room by transferring the crystals cooled 
to —20°C into a Xe-pressure cell (Xcell, Oxford Cryosystems Ltd, Long 


Hanborough, UK) and incubating for 5 min at —20°C and a xenon pressure of 
20 bar before freezing in liquid propane. 

X-ray data collection, structure solution and analysis. Diffraction data were 
collected at beam line X10SA of the Swiss Light Source (Paul Scherrer Institute, 
Villigen, Switzerland) at 100 K and processed with XDS”°. A highly redundant 
single-wavelength anomalous dispersion (SAD) data set at a resolution of 3.7 A 
was collected just above the iron K-edge at a wavelength of 1.735 A (see Extended 
Data Table 1) which was used for phase determination with AutoSHARP”'. 
SHELXD” identified 5 heavy atom sites (CC(E) = 0.24), which were used by 
SHARP for phasing, resulting in a figure-of-merit of 0.22. Density modification 
with SOLOMON?” resulted in a readily interpretable map, into which the struc- 
tures of all three subunits could be built using Coot”*. Phase extension using a data 
set of 3.1 A collected at 0.9763 A wavelength was carried out with DM”. Further 
refinement against a 2.7A data set collected at 1.0000A wavelength using 
PHENIX” and REFMAC” resulted in a model with good geometry and 
R-factors (96.4% of residues in favoured regions of the Ramachandran plot, 
0.07% Ramachandran outliers, see Extended Data Table 1) and revealed that 
two loop regions in the «-subunit («175-177 and «643-650) were no longer 
ordered, despite the increase in overall resolution. In order to confirm the identity 
of the metal sites, data sets above and below the K-edges of iron, copper and zinc 
were collected (see Supplementary Information and Extended Data Table 2). All 
other data-processing procedures were performed with programs of the CCP4 
suite**. Tunnels were identified using MOLE 2.0 (ref. 29) using standard parameter 
settings starting from wThr571, «Tyr591 and yAsp168. Structural figures were 
prepared using Pymol (Schrodinger). The model of the hydroxylamine complex 
was prepared by manual docking in COOT™, using an iron-nitrogen bond length 
between haem aI and hydroxylamine as observed in crystal structures of catalase- 
hydroxylamine complexes. 

Small-angle X-ray scattering (SAXS). Hydrazine synthase was concentrated to 
45 mg ml! in25mM HEPES/KOH pH7.5,25 mM KCL. SAX data were measured 
in 1-mm diameter quartz capillaries at the X12SA beam line (cSAXS) of the Swiss 
Light Source (Paul Scherrer Institute, Villigen, Switzerland) at 283 K. The X-ray 
photon energy was 12.4 keV, and 200 measurements of 0.5 s each were recorded 
over 10 positions along the length of the capillary, which was mounted at a detector 
distance of 2.138 m. Background measurements with the buffer only were taken 
using the identical capillaries, positions and measurement protocol. Data were 
used to a maximum momentum transfer of 0.4A7!. Data analysis and three- 
dimensional reconstruction were performed using the GNOM” and GASBOR*! 
programs from the ATSAS suite. 

EPR spectroscopy. EPR spectroscopy was performed with a Varian E-9 spectro- 
meter operating at X-band (microwave frequency 9.188 GHz; modulation ampli- 
tude, 1.0 mT) equipped with a home-made He-flow cryostat at 12 K. HZS samples 
were used as isolated at a concentration of 205 1M, filled into quartz tubes and 
shock frozen in liquid nitrogen before the measurements. Samples in the presence 
of 200 1M NH,OH or 200 HM NO plus 200 4M NH,* were prepared and ana- 
lysed in the same way. 
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Extended Data Figure 1 | Small-angle X-ray Scattering (SAXS) results. that the protein is not aggregated. c, Kratky plot (plot of q’I versus q) showing 
a, Semilogarithmic plot of scattered intensity I versus q, which was defined as __ that the protein is folded. d, Average of 18 (out of 20) dummy-atom 
q= (4m sin 9)/A. The curve is an average over 200 measurements. Features are _ reconstructions (beige) overlayed on the crystal structure (black). 
observed up to q=0.4A |. b, Guinier plot (plot of log I versus q”) showing 
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Signal Peptide 


Kuenenia_stuttgartiensis 
Jettenia_caeni 
Brocadia_sinica 
Scalindua_brodae 


ER V Mic Cleigielc K ARawayD YEIGMEK EMO GHVB) 
lv Mpdc cAI K ABaWIED YBIGMENK EBIO Clg V emt 


Bee 


ind V MENG GHaigie)G K ABNWpND YEIGMISK Ego GLaV ia 
Geese) vis cfaivelc x aM Way YEGMBK EWOGHviies 
i vr 


-blade 1 
BI il i i 62 B3 B4 
» fandom coil insertion : ; ‘ 
Kuenenia_stuttgartiensis 60 ogy PLY lie SR I Vigan L KVL Tgp FT Bia w D lesen Fa 
Jettenia_caeni 61 pen L KVL TID FEMWAF DP ChabaW Dieia Kiam 
Brocadia_sinica 60 bese) L KV L TID FRRSIAF DP CiabaW DieiaK igi 
Scalindua_brodae 60 Hiei apse) L KV L THAD FRA F DP CHERIW Dimi KES 
B-blade 2 B-hairpin insertion 
BS £6 87 B8 i) 
—- 


Cl ee a 
YYAAG 


Kuenenia_stuttgartiensis 120 iT DIAG Ther 


FAG AKBIGGGCOTWERINEID GRicpRO era BIEEcRGRIg 
Jettenia_caeni BPSMEr ACH KINGGGCOIWEMINEID CecMRONTDpaacTRRMDAy YAACHIEEGMGRETM 
Brocadia_sinica 120 IG GG CQ I WEMINWD GR GEER ONT DBBNG TIBRE}P BAY YAAGHIEEGRGRYIW 
Scalindua_brodae SPL a GMHKINGGGCOIWEMNEED CHIGINROMT DIG TRIRED My YAAGMIEEGCRGRITE 
insertion 
7 pi2 a2 BI3 
Bbledes he  lannconos => 
Kuenenia_stuttgartiensis 180 GRIDERISIN[ERYR 1 DRIOGGHIMameR Thy 
Jettenia_caeni 181 SER IBINDRR LOG GCHIggsOR Thy 
Brocadia_sinica 180 Faiueeanea bag RT DOG CENMeaaR Thy 
Scalindua_brodae 180 ete MaNeRYR L DBOGCHEMEIR Its 


ae, nL LS ELC fblade aL pag 
stuttgartiensis 240 S a cea 
Brocadia_sinica 240 BaaaivDNWDGcRyPR| 


Scalindua_brodae 240 


19 20 " 21 
Bi B B-blade 5B: 
Kuenenia_stuttgartiensis 300 WSCELagly s wie 

caeni 301 CCR LABS S Winky 
Brocadia sinica 300 CMGRIL ARV s Wink 
Sealindua_brodae 300 EICRIGHIL AI Ss Whaey 

C303 
f-blade 6 
23 p24 25 £26 


——— —— —- —p 
Kuenenia_stuttgartiensis 360 DcRRRE Oy RRaR RM i py Re pcigepeynin r sxc alc gqy BNE 
Jettenia_caeni 361 Ny 


sinica Tg) Cle L YRS PMP LP DiRUSAV SHABRED FGI 
Scalindua_brodae 357 3] IV SEIAEIRIBID FG TY] 
f-sheet 
p28 p29 p30 p31 


—- eo>rsh _——— 
KP RWINGF TAGREUF GV TIJV TY OPFDOVAIVEGYPHSWETMICEDTT Lime 
KP RWINWF TAGIOUF GV Ty 


Kuenenia_stuttgartiensis 420 


Jettenia_caeni 421 iVTYOPFDOVVEGYPHSWeTQICFDTTLaeRs 
Brocadia_sinica PUL MEK PD RWINWF TAGESUF GV TRIVT ¥ OP FDOVAVEGY PHSWeTMI CFDTT Lie 
Scalindua_brodae PERK D RWINGIF TAGESAF GVTMVTY OP FDOVINVEGYPHSWEITWICFDTT LaemaaGP Y Pag 


633 
—_ ———“- 
Kuenenia_stuttgartiensis 480 eaten = "Wen FG RyCRCHHLLGCRaSSSNSGTRFBRIORE 
Jettenia_caeni 481 eadeny = > eR a CHICHNHLLGGCBSSSNSGTRFMOR 
Brocadia_sinica 480 ereameleay = PRGA GHLLGGBEs SSNSGTBF ORG 
Scalindua_brodae 477 PSS E PSG MAICECMHLLCGHas SSNSCTEF BORE 


£34 35 36 £37 
—— —_—_— ——_ 
Kuenenia_stuttgartiensis 540 SPIT TROL sve yy mehr pap rs gaya pa Ny as ero BER SRA ir 
Jettenia_caeni 541 Re) paid «Pape ONL DIC MEWV OMEN walla P YEIGR I CSGCHINGSWENEREgaING 
Brocadia_sinica EXC c SIV ys ORB) Baldy Pad OBL. DRE cME\V OBE EwaWidnp YEicR 1 CSGCHINc SHEN R ENGNG) 
Scalindua_brodae EY BMBOD G SWIVEIS HEED IY (VOUT DENG MV OIL EWAGIUR P YEGR I CTGCHIGS NRIMCMIaNEE 


* 
Heme al Y591 


Extended Data Figure 2 | HZS-a. sequences. The HZS-« sequences from 
Kuenenia stuttgartiensis (kuste2861, gi 91200564), Jettenia caeni 
(Planctomycete KSU-1, ksu1d0439, tr A9ZRZ5), Brocadia sinica JPN1 
(brosiA2676, gi 762182098) and Scalindua brodae (scabro01598, gi 726045835, 
re-confirmed by Sanger sequencing) were aligned in ClustalW and secondary 
structure elements were manually assigned based on the structure of 
Kuenenia HZS-a. Kuste2861 shares 81% sequence identity with its Jettenia and 


B-hairpin 


—>= 


Kuenenia_stuttgartiensis 600 Eig D [baci Y,aGVKHGiAD VVVPRBDpy YGcWscrTT Sia 
Jettenia_caeni 601 ig DigeiGia vlc VKHGBDVVVPEDMy yccyscrT sii 
Brocadia_sinica 600 Espa Dain] Gy Y (eG VK GID VV VV PRDIMY YGGISSGTT Si 
Scalindua_brodae 597 Gay Dish YANG VKH Gi VV VPRIDIWY YGGRjSGTT SAi 


Kuenenia stuttgartiensis 660 eng] 
661 ASN 
660 Anny} 

Scalindua_brodae 657 IST 


a6 


209 
Kuenenia_stuttgartiensis 720 ARREK IFAT BERL LENS pop 
Jettenia_caeni 721 OF EES KVERV EAU ME IDX BOD 
Brocadia_sinica 720 OFAQRENVERT EBA Mi IDI FpaggOD 
Scalindua_brodae 717 FARRANP MAR NIN TPP 
. 
H772 


Kuenenia_stuttgartiensis 780 BESTA (ppueay! EEMm yey? Gerd peep] 
Jettenia_caeni CE SME R YpgVEWIDEGAQWDNIfGPDgyPcy ime 
Brocadia_sinica 780 
Scalindua_brodae 777 


Brocadia orthologues and 61% with Scalindua. Fully conserved peptide 
sequences are marked black. The predicted signal peptides are highlighted in 
grey. The following residues are marked (numbering according to kuste2861): 
Cys303 coordinating Zn?* (blue triangle), Tyr591 coordinating haem ol 
(pink asterisk), distal His772 of haem aI (green circle). The c-type haem 
binding motifs are highlighted in red. The figure was prepared using ESPript. 


©2015 Macmillan Publishers Limited. All rights reserved 


Kuenenia_stuttgartiensis 
Jettenia_caeni 


Scalindua_brodae 


Kueneni: 
Jetteni: 


stuttgartiensis 
caeni 


Scalindua_brodae 


Kuenenia_stuttgartiensis 
Jettenia_caeni, 
Brocadia_sinica 
Scalindua_brodae 


Scalindua_brodae 


Kuenenia_stuttgartiensis 
Jettenii i 


Scalindua_brodae 


Kuenenia_stuttgartiensis 
Jettenia_caeni 
Brocadia_sinica 
Scalindua_brodae 


Kuenenia_stuttgartiensis 
Jettenia_caeni 

Brocadia sinica 
Scalindua_brodae 


Scalindua_brodae 


Kuenenia_stuttgartiensis 


Scalindua_brodae 


Extended Data Figure 3 | Sequence alignments of HZS-B and HZS-y. The 


Signal Peptide 


AcHTIOGTRVENT 
\cRI oc TEV 
cat ocTEVE 


BENS \ciiioctHvl 


Ba S[gp CEaga Y VEC SCR WNEBID IY 
Ba Sfp Cha ¥ VINC S GRRIWNEMyD Wy : 
Ba SiaD Glgua Y VINC S GRBAWNewaDIQT D Tle 
lisiap Geeaa ¥ viec s GlauW Nema Dea 


B16 p17 
—— —s 


EISEN LRGMgyY TPDGH 

EAR T ReASEN L Riewayy TP DG 

T WIfeaseNL ly TPDGw 
init bae) SiN L RlemeayY TP DCH 


B-blade 5 
pis HZSB loop p19 B20 p21 


— 


IGGK VAGEEP L DBBANNYDGNP Y GREP cha 
IccKVAESaP LD DGN. -AMDIGOKR 
ICCKVABMMP LDEBANNYDGN. LS AMDERR 
ICCK VABBEP L DRINNY DGNP Y Cilmi? Chg 


p22 p23 al 2 03 ppd pas 
: ggo0c9o02 9000p ee 


BY OWE L DpgaEeeD Let gD LVARVE 
SORE LD BageegeD Lig LgDYLVARVY 
BORE L DpateseeD LL We Y LVARVENGLGP 


GEBUOWIE L DISUR@UD LByLINeED YL VARVIGMIGLGD SS Vig 


> a i 
LSP DGKW@YRANYFS 

LL sPDGKEgRYAANYF s 

Ls PDGKMYRIANY F SN 

IL SPDGK PMY ANY FS 

start of HZSy 

lS PK SELVRYGOF@FN OCR YECASCHYERGOTTGCMIWDLGDEGWGSWKNBKYI 
lSPMKSELVRYGOFMPNE Oc yACASCHYERGOTTGMIWDLGDEGWGSWKNBKYI 
lH SPIgK SELVRYGOFMP NE) OGY RCASCHYERGOTT GI WDLGDEGWGSWKNKYI 
las PK SELVRYGOFMFNE OGRIYBICASCHYERGOTTGHlI WDLGDEGWGSWKNBKYI| 


93 


gooonoo09 2 
IRGGRY LP PF RHEGF TGHPDEIVGABMS SPDRVCGRDPGFVFRSENF SPRLEPMI (BY IRL 
IRGGRYLDPFRHEGFTGHPDEIVGAWSSWDRVCGRDPGFVFRSENF SPRRLER MT My TRE 
IRGGRY LP PF RHEGF TGHPDEIVGAMS SMDRVCGRDPGFVFRSENF SPi@RLERM fay IRIL 
IRGGRY LP PF RHEGF TGHPDEIVGANS SMD RVCGRDPGFVFRSENF SPRLEEWMIBY IROL 
153 ar Wa 193 

C165 D168 

as 06 
299000909000 20000 


EF TGSPFRERIDGS LTEAGKRGMKEF ED PRVGCLECHP GiiaunlPRALWSDAQTHDVGTGRY| 
EF TGSPFREBIDGS LTEABKRGKIEF IUD Piav GCVECHP CBee WA LSDAQTHDVGTGRY| 
EF TGSPFRMEIDGS LTEARKRGBKUAF ID PAV GCAECHP CRpNtIPWALWSDAOQTHDVGTGRY| 
EF TCSPFRMSADCSLTEABMKRGKBF ED PV CCMECHD ClfsaNP A LWSDAQTHDVCTGRY| 


213 Heme yll 283 253 


HZS-By fusion protein from Scalindua brodae (scabro01046, gi 726046454, 


re-confirmed by Sanger sequencing) was aligned using ClustalW to the fused 


sequences of K. stuttgartiensis HZS-B (kuste2859, gi 91200562) and HZS-y 


(kuste2860, gi 91200563, lacking its predicted signal peptide) as well as to the 
fused sequences of Jettenia caeni (Planctomycete KSU-1) HZS-B (ksu1d0441, tr 


I3IPV5) and HZS-y (ksu1d0440, tr A9ZRZ4, lacking its predicted signal 


peptide) and the fused sequences of Brocadia sinica JPN1 HZS-B (brosiA2674, 


gi 762182096) and HZS-y (brosiA2675, gi 762182097, lacking its predicted 


signal peptide). Secondary structure elements were manually assigned based on 
the structures of K. stuttgartiensis HZS-B and HZS-7. The first residue of HZS-y 


in the HZS By fusions is indicated by a pink asterisk (starting at residue 
number 40 of K. stuttgartiensis HZS-y, the numbers under the Scalindua 
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ip LIGLDLVKEFDTPTLRDIYASGTYFHDG| 


ALBA GBD P YG} 


Kuenenia_stuttgartiensis 601 v 
A LBAGHSD P ¥ ca 


Jettenia_caeni 592 lp LIGLDLVKEFDTPTLRDIYASGTYFHDG| 
Brocadia_sinica 591 JALMIAGED P cha ip LIGLDLVKEFDTPTLRDIYASGTYFHDG| 
Scalindua_brodae 591 BP IIGLDLVKEFDTPTLRDIYASGTYFHDG| 


313, 


Kuenenia_stuttgartiensis 661 
Jettenia_caeni 652 
Brocadia_sinica 651 
Scalindua_brodae 651 


ART LIT 1 
BAbyT Liege Ih 
BART Lear 

ABT Livan T 1 


sequence indicate the numbering in kuste2860). The Kuste2859-60 fusion 
shares 83% sequence identity with its J. caeni and B. sinica orthologues and 72% 
with S. brodae. Fully conserved peptide sequences are marked in black. The 
predicted signal peptides of the B-subunits are highlighted in grey. The 
following residues are marked (numbering according to kuste2859 and 
kuste2860): Glu253 in HZS-B (pink triangle), Cys165 covalently bound to 
haem yI, Asp168 near the haem yI catalytic site (blue triangle) and the distal 
His332 of haem yII (green circle). The c-type haem binding motifs are 
highlighted in red. The figure was prepared using ESPript. The predicted signal 
peptides of the y-subunits not included in the alignment are: kuste2860: 
MAREMRLGGKERMKTGVVKIGLVAALGVVGLISAGGVYA_GQP...; 
ksuld0440: MRNGMIKIGLVAALGIAGVVTAGEIMA_ GTP... ;brosiA2675: 
MKSSLKIGLIAALGIAGVMTTGELMA_ GTP. 
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PY 


c aMet556 aMet556 


oAlaS73 aAla573 
ep ) 
aThr571 Xe oThr571 Xe 


aTyr591 aTyr591 


Extended Data Figure 4| Details of HZS structure. a, Covalent attachment _haems yI and yII correspond to those of the haems in CCP and MauG (sticks), 
of haem I via three cysteine sulfur atoms. The simulated annealing 2mF,—DF. as does the position of a calcium ion (spheres). The conserved tryptophan 
composite omit map is shown contoured at 1o, overlaid on the final, refined residue proposed to be involved in redox catalysis in MauG and CCP 
structure. yCys102 and yCys105 are part of the canonical CXXCH motif corresponds to His144 in HZS-y (sticks). c, Xenon binding shows that haem 
(grey cartoon). In addition, there is a covalent bond between the Sy atom of al is accessible from the solvent. The Xe atom is shown as a sphere. Green 
yCys165 and the C; porphyrin methyl group of haem yI. b, Overlay of HZS y | mesh: mF,— DF, map calculated before inclusion of Xe in the model, (100). 
(grey) with N. europaea CCP (PDB entry 11QC, light brown) and P. aMet556 has assumed a new conformation. 

denitrificans MauG (PDB entry 3L4M, dark brown). The positions of 
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LETTER 


Extended Data Figure 5 | Electrostatic surface properties of the HZS surrounding haem yII. A prominent negatively charged patch surrounds the 
complex. Haem moieties are shown as sticks. a, Overview of the whole HZS__haem as in cytochrome c binding sites. c, Magnified view of the vacuum 
structure. The bis-His-coordinated haem yII is indicated with a black circle. electrostatic properties of the surface surrounding haem aI. No significant 
Haem all is obscured in this view but its position is indicated by a black differences with the rest of the protein surface are observed. Figure prepared 
arrow. b, Magnified view of the electrostatic properties of the surface using UCSF Chimera”. 
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LETTER 
a 


b 
Unknown 
Unk 
VE HSp2 _ HSp HALSI — (51,2,3 
HALS2 Hl if \ 
All hemes 
7 As isolated \ Hee os 
As isolated 
+NH,OH A_A_ Ls1 
Ly 2 
ri V 
+NO/NH, AA LS3 
HALS1 
2 & HSp1,2 HALS2 
T T T T T T 1 
800 1200 1600 1000 2000 3000 4000 5000 800 1200 1600 2000 3000 4000 5000 
Magnetic Field (Gauss) Magnetic Field (Gauss) Magnetic Field (Gauss) Magnetic Field (Gauss) 
c 
Species 9x Gy gz Amount per ay unit 
LS1 1.739 2.263 2.66 0.16 
LS2 1.81 2.277 2.575 0.24 
LS3 1.885 2.304 2.48 0.08 
HALS1 1.41 2.057 3.127 0.95 
HALS2 nd nd 3.477 1.21 
Total low-spin 
2.57 + 0.39 
HSp1 6.24 5.641 1.992 1.59 
HSp2 5.90 5.90 1.99 0.41 
Total high-spin 
2.00 + 0.30 
Unknown g = 6.95 — 7.50 <0.2 
d e 
yHis229 yHis229 aHis772 aHis772 


heme yll heme yll 


yHis332 yHis332 


Extended Data Figure 6 | EPR spectroscopy of HZS. a, EPR spectra of HZS as 
isolated (black traces) and after addition of 200 4M NH,OH (blue traces) or 
200 1M NO plus 200 uM NH,” (red traces). The left panel shows the low 
magnetic field region highlighting the high-spin haem g, and g, resonances. 
The right panel shows the complete magnetic field scan where the intensity of 
the high-spin haem signals has run off-scale. Arrows indicate the positions 
of the various species that are listed in Extended Data Fig. 6c. The signal at 1540 
Gauss is due to a small amount (<0.2% per «Py unit) of adventitious iron. 
b, Simulation of the EPR spectra of HZS as isolated using the g values listed in 
Extended Data Fig. 6c . The difference between the simulation of HSp1 and 
the experimental spectrum defines the signal of HSp2 and its g value and 
suggests an amount of 0.41 per «Py unit (see panel c). c, HZS haem content 


heme all heme all 


aHis689 


aHis689 


per «By unit determined by EPR. The total haem content determined by EPR 
was 0.92 + 0.15 of the optically determined amount. nd, not detectable; LS, low- 
spin; HALS, highly anisotropic low-spin; HSp, rhombic high-spin peak. 

d, Stereofigure of the coordination of haem yII by yHis229 and yHis332. The 
perpendicular orientation of the histidine imidazole rings, both oriented 
towards haem meso atoms, is consistent with the g-values for HALS2. 

e, Stereofigure of the coordination of haem aI by xHis689 and wHis772. 
The orientation of the histidine imidazole groups, one (4His772) oriented 
towards a haem nitrogen atom and the other («#His689) towards a haem 
meso atom is consistent with the g values for HALS1 (see Supplementary 
Information). 
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Data collection’ 
Space group 
Cell dimensions 

a, b, c (A) 

a, By (°) 
Resolution (A) 
Rmerge 
Iiol 
Completeness (%) 
Redundancy 


Refinement 

Resolution (A) 

No. reflections 

Rwork! Riree 

No. atoms 
Protein 
Ligand/ion 


Water 

B-factors (A?) 
Protein 
Ligand/ion 
Water 

R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 


*Each data set was collected from a single crystal. 
+Highest resolution shell is shown in parentheses. 


SAD 


R32 


461.8, 461.8, 145.1 
90, 90, 120 

40-3.4 (3.5-3.4) 
0.103 (0.530) 

17.8 (3.2) 

99.9 (100) 

11.7 (8.1) 


Extended Data Table 1 | Data collection and refinement statistics 


Initial Model 


R32 


464.0, 464.0, 145.0 
90, 90, 120 
40-3.1(3.2-3.1) 
0.100 (0.498) 

16.4 (3.7) 

99.8 (99.8) 

5.7 (5.6) 


Native structure 
(pdb 5C2V) 


R32 


464.5, 464.5, 145.8 
90, 90, 120 

40-2.7 (2.8-2.7) 
0.096 (0.738) 

18.5 (3.5) 

99.8 (100) 

8.8 (9.1) 


40-2.7 
162,788 
0.235 / 0.271 


22,420 

344 (8 heme) 

18 (12 Ca, 2 Zn, 2 Mg, 2 Cl) 
56 (7 betaines) 


500 

56.9 
58.3 
53.4 


0.009 
1.2 
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Xenon complex 
(pdb 5C2W) 


R32 


464.1, 464.1, 145.1 
90, 90, 120 
48.5-3.2 (3.3-3.2) 
0.138 (0.641) 

16.6 (4.6) 

100.0 (100.0) 

10.6 (10.8) 


48.5-3.2 
97,821 
0.231 / 0.267 


22,420 

344 (8 heme) 

18 (12 Ca, 2 Zn, 2 Mg, 2 
Cl) 

56 (7 betaines), 4 Xe 
498 


71.0 
61.4 
57.4 


0.009 
1.3 
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Extended Data Table 2 


a 


Data set* 


Identification of metals in hydrazine synthase 


Above Fe-edge 


Above Cu-edge 


Space group 

Unit cell dimensions 
a, b, c (A) 

a, By (°) 

Wavelength (A) 
Resolution range (A) * 
Reflections measured 
Reflections unique 
Completeness (%) 
Redundancy N 

Vol 

Rmerge (%) 


Data set 


R32 


467.2, 467.2, 146.0 
90, 90, 120 
1.73400 

30.0-3.80 (3.9-3.8) 
1,165,022 (88,940) 
116,983 (8,703) 
99.9 (99.8) 

10.0 (10.2) 

14.4 (4.7) 

15.5 (59.7) 


Below Fe-edge 
R32 


465.3, 465.3, 145.5 
90, 90, 120 
1.74600 

30.0-3.50 (3.6-3.5) 
1,469,568 (107,411) 
147,985 (12,027) 
99.9 (100) 

9.9 (8.9) 

18.6 (5.2) 

12.6 (50.1) 


Above Zn-edge 


R32 


465.3, 465.3, 145.5 
90, 90, 120 

1.37800 

30.0-3.20 (3.3-3.2) 
1,995,467 (172,227) 
193,705 (17,117) 
100 (100) 

10.3 (10.1) 

22.2 (6.5) 

9.5 (43.6) 


Below Zn-edge 


Below Cu-edge 


Space group 

Unit cell dimensions 
a, b,c (A) 

a, B, y (°) 

Wavelength (A) 
Resolution range (A) * 
Reflections measured 
Reflections unique 
Completeness (%) 
Redundancy N 

Vol 

Rmerge (%) 


b 


R32 


466.0, 466.0, 146.4 
90, 90, 120 

1.38500 

30.0-3.40 (3.5-3.4) 
1,669,421 (143,426) 


R32 


465.0, 465.0, 144.6 
90, 90, 120 
1.28100 

40.0-3.50 (3.6-3.5) 
1,465,592 (111,737) 
147,007 (11,917) 


R32 


465.9, 465.9, 145.0 
90, 90, 120 
1.29433 

40.0-3.80 (3.9-3.8) 
1,134,042 (79,076) 
115,562 (8,699) 


Element/position wrt. 


absorption edge 
Wavelength (A) 


1.29433 1.281 


Energy (eV) 
Energy difference 
from K-edge (eV) 
HZS a 

Caal 

Caall 

Zn 

Fe Heme al 

Fe Heme all 
HZS B 

Ca Bl 

HZS y 

Ca yl 

Ca yll 

Ca vill 

Fe Heme yl 


162,603 (13,535) 
100 (100) 

10.3 (10.6) 

17.6 (5.6) 

13.1 (55.4) 

Fe/below Fe/above 
1.746 1.734 
7101 7150 

-11 38 
8.1/1.0 6.5/1.0 
8.7/1.0 4.2/0.6 
8.5/1.0 5.7/0.9 
4.4/0.5 20.3/3.1 
3.8/0.5 18.8/2.9 
8.9/1.1 6.6/1.0 
9.9/1.2 6./0.90 
8.5/1.0 8.7/1.3 
12.2/1.5 8.3/1.3 
5.9/0.7 ip ol2.7. 
5.5/0.7 16.2/2.5 


99.9 (100) 
10.0 (9.4) 
15.4 (3.8) 
13.3 (68.3) 

Culbelow Culabove 
1.385 1.378 
8952 8997 

-27 18 
6.7/1.0 8.8/1.0 
5.6/0.8 7.7/0.8 
4.2/0.6 7.2/0.8 
20.5/3.1 27.4/3.1 
18.5/2.8 23.5/2.7 
5.8/0.9 7.9/0.9 
8.4/1.3 11.2/1.3 
6.8/1.0 8.1/0.9 
6.8/1.0 11.2/1.3 
20.4/3.0 27.0/3.1 
17.6/2.6 23.0/2.6 


99.9 (100) 

9.8 (9.1) 

16.2 (4.6) 

13.2 (64.1) 

Zn/below Zn/labove 
9579 9679 
-80 20 

6.0/1.0 4.7/1.0 
4.7/0.8 3.4/0.7 
4.5/0.8 24.3/5.2 
17.4/2.9 16.3/3.5 
15.3/2.6 12.6/2.7 
4.5/0.8 4.3/0.9 
6571.1 4.6/1.0 
5.5/0.9 4.6/1.0 
TAZ 4.6/1.0 
14.0/2.3 13.8/2.9 
13;3/2°2 12.9/2.7 


Fe Heme yll 


a, Data collection statistics for the anomalous diffraction data sets used to identify metal sites in the HZS crystal structure, calculated while considering Friedel mates as individual reflections. *Each data set was 
collected from a single crystal, {Highest resolution shell is shown in parentheses. b, Heights of peaks in anomalous difference density maps used to identify metal ions. The first number is the observed peak height 
in o, the second is the peak height normalized by the height of the anomalous peak at calcium al for each data set. Those sites that show a significant difference in normalized peak height below and above an 
absorption edge are shown in grey. The data confirm the identity of the zinc ion bound to haem al. 
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CORRIGENDUM 
doi:10.1038/nature15716 


Corrigendum: Whole- genome 
characterization of chemoresistant 


ovarian cancer 


Ann-Marie Patch, Elizabeth L. Christie, 

Dariush Etemadmoghadam, Dale W. Garsed, Joshy George, 
Sian Fereday, Katia Nones, Prue Cowin, Kathryn Alsop, 

Peter J. Bailey, Karin S. Kassahn, Felicity Newell, 

Michael C. J. Quinn, Stephen Kazakoff, Kelly Quek, 

Charlotte Wilhelm-Benartzi, Ed Curry, Huei San Leong, 

The Australian Ovarian Cancer Study Group, Anne Hamilton, 
Linda Mileshkin, George Au- Yeung, Catherine Kennedy, 
Jillian Hung, Yoke-Eng Chiew, Paul Harnett, 

Michael Friedlander, Michael Quinn, Jan Pyman, 

Stephen Cordner, Patricia O’Brien, Jodie Leditschke, 

Greg Young, Kate Strachan, Paul Waring, Walid Azar, 

Chris Mitchell, Nadia Traficante, Joy Hendley, Heather Thorne, 
Mark Shackleton, David K. Miller, Gisela Mir Arnau, 

Richard W. Tothill, Timothy P. Holloway, Timothy Semple, 
Ivon Harliwong, Craig Nourse, Ehsan Nourbakhsh, 

Suzanne Manning, Senel Idrisoglu, Timothy J. C. Bruxner, 
Angelika N. Christ, Barsha Poudel, Oliver Holmes, 

Matthew Anderson, Conrad Leonard, Andrew Lonie, 

Nathan Hall, Scott Wood, Darrin F. Taylor, Qinying Xu, 

J. Lynn Fink, Nick Waddell, Ronny Drapkin, Euan Stronach, 
Hani Gabra, Robert Brown, Andrea Jewell, 

Shivashankar H. Nagaraj, Emma Markham, Peter J. Wilson, 
Jason Ellul, Orla McNally, Maria A. Doyle, Ravikiran Vedururu, 
Collin Stewart, Ernst Lengyel, John V. Pearson, Nicola Waddell, 
Anna deFazio, Sean M. Grimmond & David D. L. Bowtell 


Nature 521, 489-494 (2015); doi:10.1038/nature14410 


In this Article, the affiliations of authors Michael Quinn and Orla 
McNally should read “**Department of Obstetrics and Gynaecology, 
The University of Melbourne, and The Royal Women’s Hospital, 
Parkville, Victoria 3052, Australia”. Their affiliations have been 
corrected in the HTML and PDF versions online. 
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CORRIGENDUM 
doi:10.1038/nature15720 


Corrigendum: Improving survival 
by exploiting tumour dependence 
on stabilized mutant p53 for 


treatment 


E. M. Alexandrova, A. R. Yallowitz, D. Li, S. Xu, R. Schulz, 
D. A. Proia, G. Lozano, M. Dobbelstein & U. M. Moll 


Nature 523, 352-356 (2015); doi:10.1038/nature14430 


In this Letter on page 353, the words ‘substrate Hsp90’ should have 
been included in this sentence as follows: “Likewise, histone deacety- 
lase inhibitors, including FDA-approved SAHA, are promising anti- 
cancer drugs whose actions involve hyperacetylation of histone and 
select non-histone targets including HDAC6 substrate Hsp90, thus 
indirectly inhibiting Hsp90 (ref. 21)”. This has been corrected in the 
online versions. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature15718 


Erratum: Arithmetic and local 
circuitry underlying dopamine 
prediction errors 


Neir Eshel, Michael Bukwich, Vinod Rao, Vivian Hemmelder, 
Ju Tian & Naoshige Uchida 


Nature 525, 243-246 (2015); doi:10.1038/naturel14855 


In this Letter, the x-axis labels of Fig. 3b and d were incorrect; for both 
panels, the labels should read: “Time from odour onset (s)”. Figure 3 
has been corrected in the HTML and PDF versions online. 
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ERRATUM 
doi:10.1038/nature15719 


Erratum: A positional Toll receptor 
code directs convergent extension 


in Drosophila 


Adam C. Pare, Athea Vichas, Christopher T. Fincher, 
Zachary Mirman, Dene L. Farrell, Avantika Mainieri 
& Jennifer A. Zallen 


Nature 515, 523-527 (2014); doi:10.1038/nature13953 


In this Article, two errors were introduced into the figures during the 
production process. In Fig. 3i, the x axis should alternate “WT, 2, WT, 
2’ rather than “WT, 6,8, WT, 6,8’, and in Fig. 2g, the 6,8 bar should 
have one asterisk, rather than two. These errors have been corrected in 
the online versions of the paper. 
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CARBON CONTROL How a careful academic EQUAL OPPORTUNITIES Women, science and a | NATUREJOBS For the latest career 
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BY PAUL SMAGLIK 
wo years into his PhD programme 
in immunology, Sudarshan Anand 
tt learned that his adviser was leaving 
FMBLE the Mayo Clinic in Rochester, Minnesota, 
>. for Johns Hopkins University in Baltimore, 
Maryland. So where did that leave Anand? 
44 If he stayed at Mayo, he would need to find 


_ 


-™ < , 


LAB TRANSITIONS 


The bumpy road 
to relocation 


Faced with a need to move lab, scientists should consider as 
early as possible how to effect a smooth transition. 


another mentor and probably another PhD 
project. But if he followed his adviser to Bal- 
timore, he would have to rebuild his support 
system. “It is easier to make friends in the first 
year of grad school,” he says. “Joining a new 
programme in year three, you need to be more 
proactive and make friends outside of the lab” 

Anand followed his instincts — and his 
mentor — to Maryland. By doing so, he kept 
a research project and adviser whom he liked, 
but he was delayed by about a semester owing 
to the need to retake coursework. He also had 
to recruit a new thesis committee. “The fac- 
ulty didn't know me. I just dropped in and said, 
‘Hey, I would like you to be on my committee’ 
That was a little tricky.” The experience helped 
Anand to prepare for two subsequent reloca- 
tions: one to the University of California, San 
Diego, for a postdoc in 2007, and another in 
2013 to take a tenure-track position at Oregon 
Health & Science University in Portland, 
where he still works today. 

Science is a mobile enterprise, and research- 
ers at any stage of their careers could suddenly 
face the prospect of packing up and moving 
to a different institution, nation or even con- 
tinent. The process is rarely easy, even for 
those who don't have the headache of moving 
an entire laboratory. For graduate students, 
a move may mean repeating coursework. 
For postdocs, it could mean losing access to 
painstakingly collected data, animal mod- 
els or reagents, or sacrificing time to create 
back-up animal models or cell cultures. Sen- 
ior scientists might need to recruit and train a 
new lab team. All those factors have a role ina 
scientist’s decision on whether to move and, 
if so, how. 

For Anand, things worked out well: his team 
at Mayo was close to publishing a paper when 
the time came to move, which helped to give 
him firm scientific footing. “Tt did wonders for 
my confidence,” Anand says. His remaining 
time at Johns Hopkins felt more like a post- 
doc, he says, because he had time to do experi- 
ments that weren't essential to his dissertation. 
He passed his qualifying exams with minimal 
stress and few regrets about his circuitous > 
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> educational path. “You learn a lot about 
yourself by how you handle curveballs,’ he says. 
But it isn’t always such smooth sailing. 
Researchers who are faced with moving 
lab — whether to follow a mentor, because of 
a calamity or to snag a fellowship or research 
post — need to identify and maintain what they 
have already established before they consider 
recreating that situation in a new environment 
(see ‘Five simple steps for a fluid transition). 


ACAN OF WORMS 

Developmental biologist Phil Newmark left 
his first postdoc at the University of Barcelona 
in Spain to continue his work at the Carnegie 
Institution for Science in Baltimore. Naturally, 
he took his research collection of planarian 
flatworms (Schmidtea mediterranea) with him. 
But within two years, his entire worm colony 
had died off as a result of sudden problems 
with the in-house water-purification system — 
right as the team was making some important 
technical breakthroughs. 

These were not just any worms. The Spanish 
species differs from the North American one 
in that it reproduces asexually, has a smaller 
genome and isn't easily procured. “One cannot 
simply buy these animals from a supplier,’ New- 
mark says. “We could have lost years.” Panicked, 
he and his postdoc adviser flew back to Spain to 
make a pilgrimage to the broken fountain from 
which he had gathered his first batch. They were 
relieved to find the same type of planarian still 
dwelling in the standing water. 

The problem recurred a few years later — 
again as a result of abrupt changes in local 
water quality — when Newmark moved to the 


University of Illinois at Urbana—Champaign. 
This time, however, Newmark had a safety 
net of back-up worms, which gave him time 
to develop defined culture conditions, using 
ultrapure water as the starting point. That 
was fortuitous, because by then workers had 
repaired the fountain in Spain. It is no longer a 
habitat for planaria. 

Biomolecular engineer W T. Godbey also 
had to adapt quickly to unforeseen disaster. 
In 2005, Hurricane Katrina destroyed his lab, 
forcing him to move temporarily from Tulane 
University in New 


Orleans, Louisiana, “The first ‘few 
se — sity weeks were the 
x ae sir scariest because 
ne on ils gracuate no one one knew 
students, Xiujuan 
where anyone 


Zhang, decided to 
follow him — but 
she had to find him 
first. This was not an 
easy task in an age before ubiquitous mobile 
phones, and Tulane’s e-mail server was down 
because of the flood. She eventually tracked 
him down through his mother. 

“The first few weeks were the scariest 
part, because no one knew where anyone 
was — home, on vacation, dead,’ Godbey 
recalls. Once the flood waters had receded, he 
visited his lab at Tulane to see how much dam- 
age the hurricane had caused. His plasmid 
samples had been wrapped in garbage bags, 
boxed and shoved into a hot, mouldy freezer 
in a sweltering laboratory — yet fortunately, 
none had seriously degraded. Back at Rice, 
he and Zhang grew the DNA segments into a 


was — home, on 
vacation, dead.” 


SMOOTH MOVES 


larger library, then sequenced the plasmids to 
ensure that none had been compromised. None 
had, but Godbey knows now that back-up sup- 
plies — and disaster planning — are essential. 


COUNTRY CONUNDRUMS 

Even the most careful plans can be thrown out 
of whack when complications arise, especially 
for senior scientists. Molecular biologist Josh 
Brickman began a relocation in 2011 from the 
University of Edinburgh, UK, to the Univer- 
sity of Copenhagen, where he had accepted a 
group-leader post at the then-newly created 
Danish Stem Cell Center. He had to transport 
multiple types of animal, recreate several mouse 
lines, resettle half-a-dozen lab staff and, for a 
time, supervise labs in two countries to fulfil 
dual grant commitments. “It was an experi- 
ence,’ he says. 

Different animal-housing conventions at the 
two facilities intensified the stress inherent in 
moving more than 100 frogs and 6 lines of trans- 
genic mice. The Edinburgh facility had housed 
the mice in open-topped cages, which risked 
exposing them to pathogens. The animal facil- 
ity in Copenhagen, by contrast, had closed cages 
that were considered pathogen-free — which 
meant that the relocated mice could not be 
placed directly into them. Instead, the animals 
were mated, and their embryos were removed 
and transferred to surrogate mothers that had 
been raised in the Copenhagen pathogen-free 
lab. “It took much longer than we had expected,” 
Brickman says. “And we had problems doing 
mouse experiments during the process” 

Transporting the embryonic-stem-cell 
lines — which according to Brickman’ estimates 


Five simple steps for a fluid transition 


There is no one-size-fits-all approach to 
moving lab, but those who have successfully 
navigated the ordeal can offer useful advice. 


@ Create back-up plans for resources. 

Phil Newmark’s planaria worms survived a 
move from Barcelona, Spain, to Baltimore, 
Maryland — until a change in the lab’s 
water-purification system killed the lot. Now 

a developmental biologist at the University 

of Illinois at Urbana-Champaign, Newmark 
confirmed for himself that when moving it is 
crucial to consider resources — in his case, 
both his live specimens and water quality at his 
new site. “After our die-off, | made a big effort 
to make multiple clonal lines and establish 
long-term colonies,” he says. 

@ Make sure that your new lab is ready. 
Construction delays at biomolecular engineer 
Josh Brickman’s new lab at the University of 
Copenhagen meant that his equipment — 
including animal models — had to be housed 


in temporary facilities. That created an extra 
step in an already complicated move from the 
University of Edinburgh, UK. “If | were faced 
with doing it again, | would certainly move 
only once | knew the facility was finished,” says 
Brickman. 

@ Keep lab staff in the loop. Everyone should 
be informed as early as possible that you 

are considering a move, no matter your 
career stage. Talking openly about plans 

and considering how they might affect lab 
members and the lab’s work is important 

for a smooth transition, whether or not 

the group is going, too. “They may have 
situations you’re not aware of,” says W T. 
Godbey, an engineer at Tulane University in 
New Orleans, Louisiana. He had to move his 
lab temporarily to Rice University in Houston, 
Texas, after Hurricane Katrina in 2005. 

@ Don’t dismiss your gut feelings. Sudarshan 
Anand, a biologist at the Oregon Health & 
Science University in Portland, followed his 
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adviser from the Mayo Clinic in Rochester, 
Minnesota, to Johns Hopkins University in 
Baltimore. Although he acknowledges that 
some people rely on rational processes, 

such as cost-benefit analysis, to decide 
whether to relocate, that did not work for 

him. “Sometimes it’s better to go with your 
instincts,’ Anand says. He also recommends 
that early-career researchers avoid over- 
emphasising professional gains when mulling 
over a move. “You need to evaluate your 
personal situation,” he says. In his latest move, 
he had to consider his wife’s employment 
prospects as well. 

@ Line up a point person. Brickman says 
that having a local coordinator on board to 
handle logistical details is invaluable. “We 
had an amazing administrator who was on 
the phone with my students and postdocs 
when their apartments fell through, or when 
we had any problems with animals,” he says. 
“That saved our bacon many times.” P.S. 


WEST VIRGINIA UNIV. 


represented more than 100 person-years of 
work — also proved cumbersome. First, he 
had to ensure that every line was duplicated 
in Edinburgh. Then he and his lab members 
arranged for the cell lines and reagents to be 
stored properly in liquid nitrogen and packed 
carefully into supercooled containers. The 
group loaded a truck with the cell lines and 
reagents and then flew to Copenhagen to 
meet it and ensure that the biological mate- 
rial was still stable. 

Complicating matters further, the MRC 
Centre for Regenerative Medicine at the 
University of Edinburgh, where Brickman’s 
lab was based, was moving to a new build- 
ing at the time that the stem-cell centre in 
Copenhagen was under construction. When 
he learned that the opening in Copenhagen 
would be delayed, Brickman felt it best to 
move the bulk of his lab to temporary facili- 
ties in Denmark rather than to the new build- 
ing in Scotland — even though he would later 
have to transfer again to the permanent lab. 

Managing lab members in two sites also 
proved challenging. Brickman had landed 
a collaborative UK grant before he left, so 
he hired a new postdoc to work at his Edin- 
burgh lab and continued to manage three lab 
members who remained there. He could not 
directly supervise his new recruits much of 
the time, and so missed out on day-to-day 
knowledge of how his Edinburgh lab func- 
tioned; he commuted between Scotland 
and Denmark weekly for three months but 
worked mainly in Denmark over the next 
two years. The protracted move, he says, may 
have delayed the publication of papers — an 
unfortunate result for his junior co-authors, 
although he says that the papers were eventu- 
ally accepted into high-impact journals. 

Despite all the snags, much went right, 
Brickman says. He credits administrative 
support in both Edinburgh and Copenhagen 
for the smooth relocation of his lab group. 
“All of my people were able to move both 
work and personal lives,” he says. “None of 
them ended up homeless, despite moving 
to anew country where they didn't speak 
the language and ina city where it is almost 
impossible to find rental apartments.” In the 
end, clearing the many logistical hurdles 
proved worthwhile, he says, because the 
new stem-cell centre’s strengths outweigh 
the hassles that he underwent to join it. 

There is no way around it — moving lab, 
whether within a university or to another 
country, is gruelling, stressful and likely to 
include disaster or catastrophe. Ultimately, 
however, no one can plan for everything, 
and adaptability is perhaps the most use- 
ful resource. “Iam much more unflappable 
now,’ says Godbey. “The more extreme the 
situation, the more flexible you need to be: m 


Paul Smaglik is a freelance writer in 
Milwaukee, Wisconsin. 


TURNING POINT 


CAREERS 


Daniel Carder 


Daniel Carder, director of the Center for 
Alternative Fuels, Engines and Emissions 
at West Virginia University (WVU) in 
Morgantown, was on a team whose work 
led to Volkswagen's admission that some of 
its diesel vehicles contained software able to 
sidestep emissions tests. 


What does the centre do? 

We have done vehicle-emissions testing and 
technology development for 25 years. We 
designed the first mobile diesel-fuel measure- 
ment systems, which use detectable carbon 
emissions to determine consumption. In 
addition to fundamental research, we produce 
open data on how new automotive technolo- 
gies, such as clean diesel, prove in practice. We 
also try to make them more efficient. 


Can you describe your research? 

I have bachelor’s and master’s degrees in 
mechanical engineering from WVU, and will 
complete my PhD this year. Iam involved with 
the measurement and control of emission par- 
ticulates related to diesel-fuel usage. My thesis 
work led to the adoption of US federal stand- 
ards for particulate emissions in underground 
mine extraction. That technology controls 
highway and off-highway emissions. 


Did you expect to find any problems? 

Quite the contrary. In 2013, we received 
US$69,000 from the International Council on 
Clean Transportation to test the diesel emis- 
sions of two Volkswagen models. We expected 
to show that clean diesel fuel was doing a good 
job. We had seen successful demonstrations 
of the same type of technology in the bus and 
tractor markets and wanted to translate them 
for passenger-vehicle manufacturers. 


What were your first experimental results? 
We believed that these systems would reduce 
emissions from 1,000 parts per million of par- 
ticulates to 10-20 p.p.m. When we saw ini- 
tial data for the Volkswagen vehicles, the first 
thing we did was scrutinize our work. Did we 
make a mistake with calibration? We double- 
and triple-checked our data and procedures. 
After several quality-control exercises, we 
were assured that our findings were valid. 
But it wasn't in our contract to find out why. 


Were the data made public? 

Yes. Marc Besch, also a graduate student, 
presented the discrepancies in 2014 at a work- 
shop in San Diego, California, attended by 
people from the US Environmental Protection 


Agency, petroleum companies and engine 
manufacturers. Before we left the conference, 
we were contacted by Volkswagen asking about 
our techniques and data-collection methods. 
It seemed like a normal fact-finding mission. 


When did you realize that this was a big story? 
I was in the lab on 18 September when the 
news broke. My hands were filthy from 
working on diesel engines. My phone was ring- 
ing continuously, but I didn’t recognize the 
numbers — reporters were calling. We were 
blindsided. 


Can you describe the media attention? 
Constant. I am the poster boy for why everyone 
should have media training. It’s been trial by fire. 


How has the discovery affected your work? 
You never know when routine research could 
have a major impact. And it has provided a good 
way to talk to our students about the quality and 
custody of data. It is satisfying and rewarding to 
be recognized for work behind the scenes. 


Do you have any concerns about the fallout? 
Weare the data collectors who will develop and 
refine new technologies. One concern is the 
perception that our objectivity could be com- 
promised. We have strived to get industry to 
work with academia on emissions-control tech- 
nology and policy issues because we believe that 
researchers cannot sit in ivory towers. 


What should the public know about your work? 
As support for earmark funding has waned for 
centres such as ours with a mission that benefits 
the nation, we have shown that they have merit. 
It’s difficult to keep ventures like this afloat with- 
out congressional support. m 


INTERVIEW BY VIRGINIA GEWIN 


This interview has been edited for length and clarity. 
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Ua SCIENCE FICTION 


AGE PROGRESSION 


BY SUSANA MARTINEZ-CONDE 


THEN: Julia thought at first that the baby 
had slept through the night. She wanted to 
go back to sleep, but her breasts felt uncom- 
fortable and milk-heavy, so she got up 
instead. 

Inside the nursery, the familiar smells 
were soothing. The sweet buttery scent that 
was most intense over the soft spot on 
Rose’s head; the vanilla fragrance 
from her bedtime lotion. 

The room was too quiet. 

She listened hard for the 
baby’s breathing, willing her own 
lungs not to exhale, but only the ringing in 
her ears shattered the silence. She pawed at 
the light switch. Then she was standing over 
the crib, not knowing how she got there, 
tearing the blanket off her daughter's face. 


NOW: Rose’s wedding is next month, and 
Julia is helping with the flower arrange- 
ments. People say that she spends too much 
time obsessing about Rose, but she doesn't 
care to listen. Not a single one of her well- 
meaning advisers has walked in her shoes. 

She regrets that Rose’s father will not 
attend the simple but charming ceremony, 
but it can’t be helped. He's been out of the 
picture since Rose was a little baby. 


THEN: Julia watched the baby’s head flop 
back and forth. She knew not to shake an 
infant but could not think of stopping; 
even less imagine the rest of her life after 
stopping. She kept shaking Rose until her 
husband, awoken by her screams, ripped the 
baby from her arms to attempt CPR. He kept 
at it, too, until the medics arrived and took 
Rose from him. 


NOW: Julia never tires of looking at Rose's 
pictures. Her scrunched-up face minutes 
after she was born. The first toothless smile. 
The first shaky steps on chubby legs. Julia 
has documented every Halloween costume, 
every school graduation. The photos and 
home videos span more than two decades. 
She is amazed to think of how far she and 
Rose have come. 


THEN: The grief counsellor at the hospi- 
tal had arranged for the portrait artist, and 
Julia had been too numb to object. Now the 
girl lifted her eyes from her tablet and stud- 
ied Rose. Julia tried seeing her baby, finally 
free from tubes and probes, through the 


Family connections. 


stranger's eyes, and failed. The little body on 
her lap was growing foreign to her, even as 
her shirt became damp with unsuckled milk. 
Her eyes brimmed over and she thought it 
odd that her tears would still fall after she 
was no longer sobbing. She felt hollowed out 
and filled with liquid grief, spilling from her 
in every way. 


NOW: The first portrait had been a miracle, 
a conduit that bridged the broken landscape 
of Julia's universe. Smiling at the memory, she 
opens the file in her computer and watches 
Rose’s chest move up and down in her sleep, 
her eyelids fluttering as if about to wake. 


THEN: Julia had resisted looking at the 
finished picture on the artist’s tablet, but 
eventually relented, her mind seeking diver- 
sion from the horror on her lap. Then she 
couldn't stop watching. The baby’s cheeks 
were flushed, and the slackness of her pos- 
ture suggested sleep from which she might 
awake at any moment. “She looks alive; Julia 
said, acknowledging the young woman for 
the first time. Understanding the imploration 
in the mother’s words, the artist took back the 
tablet and resumed her work. 

Where Julias husband had failed, and then 
the medics and doctors had failed, the art- 
ist succeeded. The newly animated portrait 
showed Rose’s body moving in synch with 
her respiration, her rosebud mouth working 
as if dreaming about nursing. 


NOW: When a baby dies, parents mourn not 
only the sweet infant that was, but also the 
spirited schoolgirl, the awkward teenager, 
the assured young woman that might have 
been, that should have been. They grieve 
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for their lost grandchildren. Julia no longer 
sorrows. She’s not haunted by what ifs. The 
answers have unfolded in front of her for 
more than 20 years. 


THEN: The hospital portrait, and the age- 

progressed reconstructions that followed, 

opened an irremediable schism in Julia’s 

marriage. Her husband refused to partake 

in the immersive scenarios that later tech- 

nological refinements allowed. His 
desire for another ‘real life’ child 
felt like a betrayal of Rose. 


NOW: Julia’s ex-husband has 
been happily married for most of Rose's 
life. He doesnt speak to his first wife or first- 
born child. Rose’s half-siblings have never 
met her. 


THEN: After her husband left, Julia con- 
sulted with the best virtual developers and 
age-progression biographers. She decided 
that Rose should have an ordinary life, not 
a fairy-tale one. Her daughter would expe- 
rience the ups and downs, the minor disap- 
pointments and small joys that a regular child 
would. And she, her mother, would not run 
Rose's life as it evolved — she gave creative 
control to the company that she hired to age- 
progress Rose. 


NOW: Julia is not a fool. She knows — how 
could she ever forget — that her baby died 
halfa lifetime ago. And yet. 

She remembers a college lecture about 
multiple worlds. The professor said that our 
universe splits every time we make a deci- 
sion. When we park our car in the left spot 
rather than the right one, when we fail to 
set the alarm clock and sleep in instead of 
waking up early, when we stay in bed and 
don’t check on the baby in the middle of the 
night, the universe breaks in half. There are 
as many universes as choices. 

Whos to say that the virtual life Julia has 
created for Rose is not a window into the life 
that another Rose is living elsewhere, in a 
more benign reality? Julia likes to think it is. 

She goes back to the flower arrangements. 
Roses, of course. Her daughter is marrying 
young, but she may not wait too long to 
make Julia a grandmother. 

She can’t wait to meet her grandkids. = 


Susana Martinez-Conde is a scientist who 
writes for fun. Ina parallel universe, she’s a 
writer who dreams about doing science. 
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the world — and no wonder. Some 1.7 million women 

were diagnosed with the disease in 2012, making it 
a global priority. Researchers have made great strides in the 
treatment of some types of breast cancer (see page S102), but 
the battle continues on many fronts. Perhaps the most exciting 
area of research is immunotherapy (page S105), whereby 
scientists are attempting to harness the body’s own immune 
system to fight and prevent malignancies. The success of 
biological drugs in the treatment of people with a specific 
tumour demonstrates the potential of targeted treatments 
(page S110). Meanwhile, researchers are using big data to 
identify new targets and treatment strategies (page S108). 
But not every cancer needs treatment, and the hunt is on for 
biomarkers that can sort the cases that demand action from 
those that are better left alone (page S114). Research on the 
interplay between environment and genes has illuminated the 
workings of the disease and helped to identify who is really 
at risk (page S116). Although many women try to protect 
themselves through regular mammograms, worries about 
false alarms and overdiagnosis have spurred efforts to reform 


B= cancer is perhaps the most-studied malignancy in 


screening to focus on the cancers that really matter (page S118). 


However, each woman needs to decide for herself whether to 
be screened (page S104). And patients should have a say in the 
course of treatment. Some may want to go down the aggressive 
path no matter what the side effects, and others prefer to take 
the slow, cautious route. With many of the world’s top minds 
working on their behalf, they should not feel alone in the fight. 

We are pleased to acknowledge the financial support of the 
Medipolis Proton Therapy and Research Center, a part of 
the Medipolis Medical Research Institute, in producing this 
Outlook. As always, Nature retains sole responsibility for all 
editorial content. 


Chris Woolston 
Contributing editor 
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in| A tumour through time 
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{200 MILLION YEARS AGO 


~~ 


Milk-producing skin glands evolve dur- 
ing the Late Triassic period in a group of 
egg-laying proto-dinosaurs called cyno- 
donts. Instead of sweat or scent, these early 
mammaries produce a simple milk to 
supplement the diet of hatchlings. Over 
time, these glands grouped together 

under nipples and began responding to 
sex hormones such as oestrogen. 


> a 


A medical text from ancient Egypt contains 
the first known reference to breast cancer. It 
describes 48 surgical problems, including 
“bulging tumours on the breast”. The 
unknown author describes “swellings 
on [the] breast, large, spreading and 
hard: touching them is like touching 

a ball of bandages”. It would be some 
time before surgeons could offer any- 
thing beyond a diagnosis. The author's 
take on treatment: “there is nothing”. 


> a 


Doctors begin to embrace the possibilities — and limitations — of surgery for breast 
cancer. In eleventh-century Moorish Spain, the noted surgeon Abu al-Qasim al- 
Zahrawi writes that breast cancer could be cured when “complete removal [of the 
tumour] is possible, and especially when in the early stage and small. But when it is 
of long standing and large, you should leave it alone. I myself have never been able to 
cure such, nor have I seen anyone else succeed before me”. 


~ @ 


Emil Grubbe, an electronics enthusiast and medical student at the Hahnemann Medical 
College in Chicago, Illinois, assembles one of the earliest X-ray devices. A colleague remarks 
on the radiation burns on Grubbe’s hands and suggests that X-rays might be used against 
unhealthy tissue. The first recorded instance of radiation oncology occurs later that year when 
Grubbe’s machine is used to irradiate a breast carcinoma in a woman named Rose Lee. She 
reportedly received several hour-long treatments and died shortly thereafter. 
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BOX OF BONES ~2200 Bc 


During a 2015 excavation 
of the Egyptian necropo- 
lis Qubbet el-Hawa 
near Aswan, a team led 
by Egyptologists and 
anthropologists from 
the University of Jaén in 
Spain discovered a coffin 
with remarkable contents: 
bones of a woman, show- 
ing the tell-tale deformations of 

metastatic breast cancer (pictured). She 

died some 4,200 years ago, making her the first 
known victim of the disease. 


DEEP CUT 1590 


French surgeon Barthélémy 
Cabrol suggests that advanced 
breast cancer could be cured by 
removing the underlying chest 
muscles along with the breast. 
Others will attempt variations 
on this idea for centuries, with 
often dismal results. 


KNIFE SKILLS ~1894 


In the United States, William Halsted and 
Willy Meyer independently develop radical mas- 
tectomy to treat advanced breast cancer. The 
operation removes the entire breast, the pectora- 
lis major and minor muscles directly underneath, 
and the axillary lymph nodes from the armpit. 
The surgery offers people with advanced breast 
cancer the first serious hope of a cure. 


X-RAY VISION 1956 


Robert Egan starts to develop effective mam- 
mography, an X-ray examination that can detect 
breast tumours that are too small to be felt. 
By the 1970s, mammography has become a 
popular screening test for women. 


DOUBLE STANDARD 1975 


The National Surgical Adjuvant Breast and 
Bowel Project shows that surgery combined 
with chemotherapy works better against breast 
cancer than surgery alone. Combined treatments 
become the standard of care. 
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BREAST CANCER iuuieltls 


Breast cancer, one of the most common and deadly malignancies, has undoubtedly plagued 
humans since the dawn of our species. The history of the fight against the disease is one 


of lurching progress against a backdrop of misery. But recent decades have seen greatly 
improved treatments and increased survival. By Will Tauxe. 


DRUG STOP 1977 


The oestrogen-blocking drug tamoxifen (pic- 
tured) is approved in the United States as a 
treatment for advanced metastatic breast can- 
cer. Today, tamoxifen is one of many hormone- 
blocking drugs used worldwide to treat — and 
in some cases prevent — certain types of breast 
cancer. 


BUSTED FLUSH 1995 


The US Nurses’ Health Study reveals that hor- 
mone replacement therapy (HRT) increases the 
risk of developing breast cancer. At the time, HRT 
was popular both to treat and prevent menopau- 
sal symptoms among post-menopausal women. 
Today, HRT is no longer routinely recommended 
for long-term use in post-menopausal women. 


LESS IS MORE 2002 


Two large studies show that people with breast 
cancer live just as long after small lumpectomy 
surgery combined with radiation as they do after 
radical mastectomy. Further studies show that 
only a narrow 2-millimetre ‘clean margin’ of 
healthy tissue needs to be removed along with 
the cancer in a lumpectomy. 


TIMES CHANGE 2009 


The US Preventive Services Task Force recom- 
mends that women should be offered a mam- 
mogram first at age 50, and then every other 
year after — a departure from previous advice 
to start annual screening at age 40. The change 
sparks debate about the balance between the 
harm of unnecessary treatment and the risk of 
undiagnosed cancers. 
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> a 


Mary-Claire King and colleagues (J. M. Hall et al. Science 
250, 1684-1689; 1990) use samples of DNA from families 
with a history of breast cancer to establish a link between 
mutations in a tumour-suppressing gene she names 
BRCA1 and an elevated risk of breast and ovarian cancer. 
The discovery changed thinking about genetic influences 
on cancer. Further research showed that mutations in 
another gene, BRCA2, could also increase cancer risk. 
Today, some women who test positive for these mutations 
— including actress Angelina Jolie (pictured) — choose 
to have their breasts removed to reduce their cancer risk. 


> a 


Charles Perou and colleagues (C. M. Perou et al. Nature 406, 747-752; 2000) show that breast 
cancers can be grouped into clinical subtypes on the basis of mutations in their DNA. Analysis 
of tumour-cell DNA enables doctors to choose treatments that are more likely to be effective. 
The subtype of ‘triple-receptor-negative’ breast cancer is particularly difficult to treat because 
these cancers do not respond to signals from any of the breast growth hormones: oestrogen, 
progesterone and human epidermal growth factor 2. 


> Ba 


In the case Association for Molecular Pathology v. Myriad Genetics, the US Supreme Court 
overturns molecular-diagnostics company Myriad’s patents on the genetic codes of BRCA1 
and BRCA2. The court's decision states that “a naturally occurring DNA segment is a product 
of nature and not patent eligible’, although tests to find specific harmful mutations are still 


considered patentable. 
i > 
> A 


In 2015, breast-cancer survival rates are at their highest ever. More than 6 million people 
worldwide are still alive 5 years after being diagnosed with breast cancer, although survival 
rates continue to lag in areas without reliable access to advanced medicine. Breast cancers are 
now treated with tailored combinations of surgery, chemotherapy and radiation, and contin- 
ued research is making treatments more precise and minimizing side effects. 
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UU TRelt@ BREAST CANCER 


PERSPECTIVE 


DC, London or my home city of Sydney, where women regularly 

have screening mammograms. In the waiting room sit women 
in their 40s, 50s and older who have breast cancer that was found by 
screening. They are scared and deeply uncertain about their future. 
But do they all need to be there? No, because some of them have been 
‘overdiagnosed’' * — they are about to have treatment for a breast can- 
cer that would not have caused any health problems had it been left 
undetected and untreated. 

In an ideal scenario, breast-cancer screening would find poten- 
tially lethal breast cancers before they have caused symptoms (such 
as a lump). Finding these cancers early would permit the best pos- 
sible treatment and mean fewer breast cancer deaths”. This was the 
premise that underpinned screening programmes, but, increas- 
ingly, research shows that the picture is much more complicated. 

Breast cancer takes many forms — some 
indolent and harmless, and some very aggres- 
sive and lethal that grow and spread rapidly. 
Because a screening mammogram is a snapshot 
in time, it is more likely to catch a slow-grow- 
ing cancer than a fast-growing one’. In other 
words, it leads to overdiagnosis because of its 
tendency to detect the cancers that are unlikely 
to be harmful. 

As a woman in my 50s, this is not just an 
academic issue. I have two options: I can have 
regular screening (every 2 years from age 50 to 
74 is the recommended schedule in Australia), 
or I can choose not to. To decide, I need to 
consider the potential outcomes. One possi- 
bility is that my mammograms will always be 
normal and that will be that. Or perhaps one 
of my mammograms will not be normal and I 
will have further tests, which will show I do not have cancer. This 
is called a false positive. If I regularly go for screening for 20 years 
or so, there is more than a 40% chance that I will have one of these 
scares”. I would feel anxious, and it might take me some time to 
recover, but, on its own, I do not think the threat of a false alarm 
would stop me from being screened. The third possibility is that I 
am diagnosed with breast cancer. Like the women in the waiting 
room, I would be advised to have surgery (to remove the cancer or 
the entire breast), and probably radiotherapy, along with hormone 
therapy for 5 or more years. 

So why would I even consider not getting screened? Am I not 
worried about dying of breast cancer? Yes, of course I am. But ’m 
also worried by the possibility that I could be seriously harmed by 
the treatment of a cancer that would never have affected my health. 

Radiotherapy will increase my risk of heart disease, especially if 
the breast cancer is on the left, the same side as the heart. Cardio- 
vascular disease killed my mother and three of my grandparents, 
so that is a big concern for me. I would almost certainly experi- 
ence some of the common side effects of hormone therapy, such 
as tamoxifen, including mood problems, low libido and vaginal 
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BECAUSE A 


SCREENING 
MAMMOGRAM 


IS A SNAPSHOT IN 
TIME, IT 1S MORE 
LIKELY TO CATCHA 
SLOW GROWING 


CANCER. 


») The risks of overdiagnosis 


Screening mammograms catch some cancers that pose little threat. 
Alexandra Barratt explains why she may decide to skip the scans. 


dryness — and an increased risk of blood clots and stroke. Both my 
daughters would feel anxious and would, forever after, have a ‘fam- 
ily history of breast cancer. Importantly to me, the psychological 
impact and physical risks of being treated for an overdiagnosed 
cancer would start straightaway. As I get older, I find that having 
good health and enjoying my life today becomes more and more 
important than the future. 

At present, scientists and doctors cannot identify an ‘overdiag- 
nosed’ cancer at the individual level because we do not have tests 
that can reliably distinguish between breast cancers that will progress 
to cause health problems and those that will not. Overdiagnosis can 
only be inferred from population-level statistics, so I could never be 
certain whether I had been overdiagnosed or not. I’m not sure that 
is a Pandora’s box that I’m willing to open. This is the conundrum at 
the heart of overdiagnosis, and the only way I can avoid it is by not 
getting screened in the first place. 

I still might, however, decide to get screened 
because of the small chance that it could prevent 
me dying from breast cancer. By some estimates, 
screening 1,000 women regularly for 20 years 
might prevent 4 breast cancer deaths’. Some esti- 
mates are even lower*’. That means that there is 
roughly 0.4% chance that screening will make 
the difference between me dying of breast cancer 
and not. Estimates vary, but I’m more likely to be 
overdiagnosed than to have my life saved — three 
to ten times more likely®”. 

To be clear, if I decide not to have screening, 
that does not mean I’m pretending breast cancer 
does not exist. If I notice a change in my breasts 
such asa lump, I will see my doctor without delay 
to have it tested. And if, in the future, the sci- 
ence about breast-cancer screening changes, I 
will carefully consider the new information. For now, I will do what 
I can to minimize my risk of breast cancer by watching my weight, 
incorporating exercise into my life and moderating my alcohol intake. 

Whatever I decide, my decision should not determine that of 
anyone else. Women have different bodies, families, circumstances, 
preferences and fears. What matters is that each woman has access 
to the best information that science can provide to allow her to 
choose wisely what is best for her. = 


Alexandra Barratt is an epidemiologist at the University of Sydney, 
Australia. 
e-mail Alexandra. Barratt@sydney.edu.au 
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BREAST CANCER C 


Another shot at cancer 


Targeting the immune system to fight breast cancer was all but dismissed in the 1990s, but 
the strategy is making a big comeback with the possibility of a breast-cancer vaccine. 


BY CHARLES SCHMIDT 


hile on her hospital rounds at 
the Walter Reed National Mili- 
tary Medical Center in Bethesda, 


Maryland, Elizabeth Mittendorf encoun- 
tered a patient whose story is still fresh in 
her mind 14 years later. The woman had been 
successfully treated for breast cancer more 
than 15 years earlier, but the disease had 
returned. “And I wondered, how is it possi- 
ble that someone beats breast cancer only to 
face it again?” recalls Mittendorf, now a sur- 
gical oncologist at the MD Anderson Cancer 


Center in Houston, Texas. “To me this could 
only mean that this woman’s immune system 
had failed her.” 

Mittendorf has since dedicated much of 
her career to breast-cancer immunotherapy 
— a field that is just starting to hit its stride. 
Immunotherapy drugs boost the body’s 
inflammatory response against malignant 
tumours. None have been approved for the 
treatment of breast cancer, and many uncer- 
tainties remain, but this is an undeniably 
exciting time. As of August, more than 40 
clinical trials of breast-cancer immunother- 
apies are underway worldwide, and two of 
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them are in phase 3 — the final stage before 
regulatory approval can be sought. People 
with breast cancer already benefit from effec- 
tive treatments, and 5-year survival rates for 
newly diagnosed cases top 90% in the United 
States. But drugs that enhance the immune 
system’s battle against malignancy might pre- 
vent recurrences altogether, says Mittendorf, 
the principal investiga- 
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immunotherapy, see: 
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breast cancer,’ says Mary Disis, an oncolo- 
gist at the University of Washington School 
of Medicine, Seattle. “The immune system 
remembers cancer antigens, and it can seek 
out and kill off metastases anywhere in the 
body, including in the bones and the brain. 
We just have to figure out how to sustain that 
response before it’s exhausted.” 


ALONG HISTORY 

The concept of cancer immunotherapy dates 
back more than a century. The bone surgeon 
William Coley, who worked at what later 
became the Memorial Sloan Kettering Can- 
cer Center in New York, injected his patients 
with a killed bacteria vaccine during the late 
1800s in the hope of stimulating the body’s 
defences. During the 1990s, physicians began 
treating people with cancer with high doses 
of interleuken-2 (IL-2) and interferon-y 
(IFNy) — inflammatory cytokines released 
by infection-fighting white blood cells called 
T cells. Some people with cancer have lived for 
decades with the help of cytokine treatment, 
but because the inflammation that high-dose 
cytokines generate is systemic there can be 
life-threatening side effects, including vascu- 
lar leakage and kidney damage. 

A crucial breakthrough came in 1996, 
when James Allison, an immunologist at 
MD Anderson Cancer Center, and his col- 
leagues showed that it was possible to amplify 
anti-cancer immunity by taking the brakes 
off a molecular checkpoint that would oth- 
erwise dampen the immune response’. The 
body relies on these checkpoints to regulate 
inflammation and limit the risk of autoim- 
mune disease. But as they deliver this essen- 
tial service, checkpoints interfere with the 
immune system’s efforts to destroy growing 
tumours. Allison's research showed that block- 
ing a checkpoint known as CTLA-4 located on 
T-cell surfaces enhances the immune response 
to cancer with fewer side effects than those 
brought on by IL-2 and IFNy. 

In 2011, the US Food and Drug Adminis- 
tration (FDA) approved the CTLA-4 inhibi- 
tor ipilimumab for use in treating advanced 
melanoma. During phase 3 testing, people 
with the disease who were treated with ipili- 
mumab lived an average of four months longer 
than those who went without the drug”. Some 
super-responders are still alive today. 

While Allison targeted CTLA-4, other 
researchers were exploring the clinical pos- 
sibilities of another immune checkpoint on 
T cells: programmed cell death protein-1 
(PD-1). PD-1 binds to its ligand on cancer 
cells forming a complex called PD-1/PD-L1 
and hiding the tumours from the immune 
system. Preventing the formation of these 
complexes has proved beneficial in cancer 
treatment. In December 2014, the PD-1 inhib- 
itor nivolumab became the latest immune 
checkpoint therapy to gain FDA approval, spe- 
cifically for the treatment of metastatic lung 
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Trials of breast-cancer vaccines are underway, but the vaccines may work best when combined with 
other treatments. 


Ca 


cancer’. European approval followed in April. 

At first, there was widespread doubt that 
checkpoint inhibitors would be any use 
against breast cancers. The approach only 
works in tumours that have already been 
invaded by tumour-targeting white blood 
cells called tumour-infiltrating lymphocytes 
(TILs). Melanoma and lung tumours contain 
a lot of TILs, which makes them easy targets, 
but breast cancer tends to have relatively low 
levels of TILs. “So the thinking was that it 
wouldn't respond as well to immunotherapy,” 
says oncologist Leisha Emens at the Johns 
Hopkins University School of Medicine in 
Baltimore, Maryland. “And since breast cancer 
was already being treated with effective drugs, 
it wasn't associated with unmet medical needs 
in the same way that melanoma and lung can- 
cer were.” 

As checkpoint inhibitors make their way 
onto the market, attitudes have clearly shifted. 
“The entire medical community is awaken- 
ing to an appreciation of their potential role 
in treating all cancers,” says Jill O Donnell- 
Tormey, chief executive officer and scientific 
director of the Cancer Research Institute, a 
non-profit organization based in New York. 
“This is why you're seeing all these clinical tri- 
als in breast cancer now.’ 


TRIAL BY VACCINE 

Of the more than 40 ongoing breast cancer 
immunotherapy clinical trials monitored by 
the Cancer Research Institute, roughly two- 
thirds involve vaccines. Breast-cancer vaccines 
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take a number of different forms: Neu Vax, for 
instance, is derived from the cell-surface pro- 
tein HER2, which some breast tumours have 
in large quantities, and is a target for the drug 
Herceptin (trastuzumab). Vaccines are also 
made from cancer-cell DNA, or entire cancer 
cells, and in some cases they are custom-made 
from a patient’s own white blood cells exposed 
to tumour antigens in the laboratory. 

Whatever their origin, cancer vaccines are 
designed to stimulate a particular kind of anti- 
tumour immunity, specifically: type 1 immu- 
nity. Type 1 immune responses depend on 
CD4 T-helper cells that secrete highly inflam- 
matory cytokines, such as IFNy and tumour 
necrosis factor-a (TNF-a). In turn, these 
cytokines activate the CD8 T cells that go on 
to attack and kill cancer cells. 

But vaccines may not work well enough as 
breast-cancer treatments by themselves. Dur- 
ing the phase 1/2 trial of NeuVax, for instance, 
89.7% of treated women achieved 5-year 
disease-free survival compared with 80.2% of 
women who did not receive the vaccine* — a 
result that some found discouraging. Mitten- 
dorf, however, argues that Neu Vax was able to 
cut what would have been a 20% risk of 5-year 
recurrence in half, “which isa fairly impressive 
number that’s certainly of interest to patients.” 

A phase 3 trial, called the PRESENT study, 
will randomize 700 women with early-stage 
breast cancer and low to intermediate HER2 
expression to receive either NeuVax or an 
immune-stimulating chemical called granu- 
locyte-macrophage colony-stimulating factor. 
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According to Mittendorf, if the trial reaches 
its endpoint of 3-year disease free survival, the 
FDA will consider the vaccine for approval — 
results are expected in 2018. 

An increasing number of researchers, how- 
ever, believe that the future of breast-cancer 
immunotherapy lies in giving vaccines and 
checkpoint inhibitors as combined treat- 
ments. In this way, vaccines will stimulate 
T-cell responses in the breast that checkpoint 
inhibitors can then amplify and sustain. Dis- 
cussions about combining Neu Vax with check- 
point inhibitors in future trials are ongoing. 
“T think this will be an efficacious strategy,” 
Mittendorf says. 


HITTING THE TARGETS 

Meanwhile, the field is grappling with how 
to match people with breast cancer with the 
appropriate immunotherapy. The degree to 
which women with the most common form 
of breast cancer, oestrogen-receptor positive, 
will benefit from immunotherapy remains 
an open question, according to Carsten Den- 
kert, a physician at the Institute of Pathology, 
Charité University Hospital in Berlin. Many 
oestrogen-receptor positive tumours, which 
make up 80% of all new diagnoses, are slow 
growing and respond well to existing hormo- 
nal treatments, such as tamoxifen or aromatase 
inhibitors. 

Mittendorf points out that the number of 
women who die of oestrogen-receptor positive 
cancer that has stopped responding to exist- 
ing therapies exceeds the number of women 
diagnosed with more aggressive types of breast 
cancer. The “challenge and opportunity,’ she 
says, is to make oestrogen-receptor positive 
breast cancers better candidates for immune 
therapy, for instance, with combination treat- 
ments. 

According to Christopher Heery, director 
of the Clinical Trials Group in the Labora- 
tory of Tumor Immunology and Biology at 
the National Cancer Institute in Bethesda, 
Maryland, most immunotherapy trials look 
at highly aggressive triple-negative tumours, 
a much needed focus given that these cancers 
lack receptors for oestrogen, progesterone and 
HER2 — targets of existing cancer drugs. Debu 
Tripathy, who chairs the Department of Breast 
Medical Oncology at MD Anderson Cancer 
Centre, points out that the more mutated anti- 
gens that cancer cells carry, the more foreign 
they look to the immune system. And since 
triple-negative tumours express more mutated 
antigens than other breast cancer types, he 
says, they could be especially good candidates 
for immunotherapy. 

Researchers hope to identify biomarkers 
that can help to predict which women with 
breast cancer will respond best to immuno- 
therapy, but reliable candidates remain elusive. 
Checkpoint blockades in breast cancer have for 
the most part been limited to drugs that inhibit 
PD-L1. Examples include MPDL3280A, an 


engineered monoclonal antibody manufac- 
tured by Basel-based Roche and currently in 
phase | testing for triple negative breast cancer, 
and pembrolizumab, manufactured by Merck 
in Kenilworth, New Jersey, which is in phase 2 
trials with the same purpose in mind. It was 
initially thought that better responses would 
correlate with high PD-L1 expression levels; 
so much so that clinical trials have excluded 
women with breast cancer shown to be PD-L1 
negative on screening. But PD-L1 expression is 
dynamic and varies not just between individu- 
als, but also over time. Up- and downregula- 
tion of PD-L1 by cells is a normal response to 
excessive inflammation — cells upregulate it 
to reduce inflammation and downregulate it 
when the inflammation subsides. 

Levels of PD-L1 may vary then depending 
on when samples are taken, and researchers 
still debate whether low PD-L1 levels should 
affect study enrolment. “Some people will 
say “You need high PD-L1 for checkpoint 
inhibitors to work and others will say ‘we ran 
a study and PD-L1 didn’t matter,” Disis says. 
“The fact is that it’s just not a great biomarker.” 
According to Heather McArthur, a medi- 
cal oncologist at Memorial Sloan Kettering 
Cancer Center in New York, there are other 
promising possibilities, including a marker 
for T-cell activation called ICOS, which 
predicts better responses to ipilimumab in 
patients with melanoma and a marker for 
T-cell proliferation called K167. 

Another valuable biomarker, says Denkert, 
could be the amount of TILs in the tumour. 
oestrogen-receptor positive cancers tend to 
have low TIL levels, but that is not necessar- 
ily true of more aggressive malignancies, such 
as triple-negative and HER2-positive breast 
cancer. According to Denkert, about 25% of all 
aggressive breast can- 
cers are “lymphocyte 
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no TILs whatso- 

ever, and the remaining 50% sit somewhere 
in between. Denkert’s research shows that a 
breast tumour’s TIL count is predictive of the 
response to chemotherapy — a high count pre- 
dicts better responses — and he expects it to do 
the same for immunotherapy. “Tumours with 
zero lymphocytes will have only a small chance 
of responding to checkpoint blockade,’ he says. 
Boosting otherwise small amounts of TILs 
using vaccines could be a winning approach, 
Denkert says. But he thinks that in some 
instances, tumours characterized by low TIL 
levels might remain intrinsically invisible to 
the immune system even with this treatment 
because they do not express enough T-cell 
receptors. Denkert now plans to investigate 
that hypothesis in an upcoming clinical trial 
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sponsored by the German Breast Group, a 
network of the country’s academic research 
institutions. 

Heery, however, cautions that not all TILs are 
equal. “Characterizing them is just as important 
as counting them, he says. TILs could reflect 
type 1 immunity or type 2, which can suppress 
anti-tumour responses, Heery says. That is a 
crucial distinction, assuming that, as some 
research suggests, there tends to be a dispro- 
portionate number of type 2 TILs in breast can- 
cer. Denkert agrees, but adds that approaches 
to discern type 1 and type 2 lymphocytes in 
tumour samples are in development. “Immu- 
nologists will say ‘the immune system is com- 
plicated and we have to look at the different 
types of immune cells? which is completely 
true,” Denkert says, “but we pathologists see 
this type of characterization as a second step 
that follows an initial effort to quantify the 
overall number of immune cells in the tumour. 
If we combine both worlds and accept different 
approaches, we can generate a more complete 
picture” 

As is the case with other experimental 
treatments, immunotherapies are being 
tested mainly in advanced, metastatic breast 
cancer. “Drug development is always done 
in the metastatic setting first to try to find 
the agents that work rapidly,’ Heery says. 
The problem with cancer immunotherapies 
— especially vaccines — is that they can take 
up to three months to build up an adequate 
response, “so if sceptics don't think they’re 
working well enough, it could be that we're 
just testing them in the wrong setting.” 

Ultimately, immunotherapies may have 
more success when used to treat early-stage 
breast cancer, and McArthur is one of the 
few investigators working in that setting. She 
selectively breaks tumour sections into frag- 
ments by freezing them with a tool that looks 
like a biopsy needle — the tiny tumour frag- 
ments are thought to attract a more robust 
immune response to cancers that are not 
highly immunogenic to begin with, she says. 
She is now testing this cryoablation approach 
in combination with ipilimumab in women 
with newly diagnosed oestrogen-positive 
cancer, independent of their HER2 status. 

“This is an incredibly exciting time to be in 
oncology,’ she says. “We're seeing remarkable 
advances with immunotherapy in other solid 
tumours like melanoma and lung cancer, and 
I’m enthusiastic we'll have success in breast 
cancer too. We're on the cusp of a new era.” m 


Charles Schmidt is a freelance science writer 
in Portland, Maine. 
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Big hopes for big data 


Technology is allowing researchers to generate vast amounts of information about tumours. 
The next step is to use this genomic data to transform patient care. 


BY JILL U. ADAMS 


drian Lee has dedicated his career to 
Asin breast cancer, which is to say 

he is actually tackling many different 
diseases at once. “No two breast cancers are the 
same,’ says Lee, a pharmacologist and chemi- 
cal biologist at the University of Pittsburgh in 
Pennsylvania. “Cancer is way more complex 
than we know.” 

Lee is using genomic technology to fully 
describe cancers of the breast and apply that 
knowledge to guide treatment decisions for 
individual patients. “We can now analyse mul- 
tiple variables from a single specimen, such as 
changes in DNA, changes in RNA and changes 
in methylation,” he says. “Genome-wide scans 
allow for better systems biology and allow us to 
learn what's gone wrong in a particular tumour’ 

Sequencing tumours is faster, cheaper and 
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easier than ever. With many researchers collect- 
ing sequence data and uploading these to public 
databases such as the The Cancer Genome Atlas 
(TCGA), opportunities to describe the many 
different cancers that arise in breast tissue are 
upon us. “The challenge used to be generating 
the data,” says Nicholas Navin, a geneticist at 
The University of Texas MD Anderson Cancer 
Center in Houston. “Those issues have been 
resolved. Now the challenge is data processing 
and data analysing — interpreting the muta- 
tions and communicating those to oncologists.” 

At the University of Pittsburgh, researchers 
are working to link the molecular signatures 
of people with breast cancer to a host of clini- 
cal data, including demographic information 
associated with risk such as age, ethnicity and 
body weight. They are mining electronic health 
records for clinical correlates, treatment interac- 
tions and outcomes. “We've got a big haystack 
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and we're trying to find the needle,’ says Lee. 
“But we're also trying to incriminate the needle, 
by linking it to lots of things.” Collecting all that 
data from patients’ electronic records adds up, 
Lee says. It takes infrastructure — Pittsburgh 
has already accumulated 5 petabytes, or 5 mil- 
lion gigabytes, which is enough data to overload 
around 40,000 new iPhone 6 devices. 

Making the connection between the reams 
of data coming out of sequencing laboratories 
and the individual women fighting breast can- 
cer takes big-time computing power. Big data 
needs researchers who are comfortable with 
statistical noise and those who are old hands at 
the iterative process required to create flexible 
computer programs. 


FROM DATA TO KNOWLEDGE 
Big-data researchers take a large data set 
and look for patterns. The idea is to identify 
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mutations that can be targeted with drug 
treatment. It is the essence of personalized 
medicine: screen a patient’s tumour for a set 
of biomarkers to choose the best treatment to 
fight the cancer. Big-data researchers believe 
that analysing the data of the thousands of 
tumours that have come before will reveal 
patterns that can improve screening and diag- 
nosis, and inform treatment. 

Lee and his colleagues have illustrated how 
big-data science led to a rethink of breast can- 
cer’. They used two public databases — TCGA 
and METABRIC (Molecular Taxonomy of 
Breast Cancer International Consortium), 
which contain data on the entire set of genes, 
RNA transcripts and proteins of thousands of 
breast-cancer tumours — to parse out poten- 
tial differences in the molecular signatures of 
breast tumours in younger compared with 
older women. Women who are diagnosed 
before the age of 40 tend to have worse dis- 
ease: they are more likely to have later-stage 
cancers, poorer prognoses and worse survival 
outcomes than older women. 

The team analysed tumour data from women 
under 45 years old, who were probably pre- 
menopausal, and women over 55 years old, who 
were probably postmenopausal. “We looked at 
everything you can lookat;’ Lee says, including 
mutations in the genome, mutations in RNA, 
tumour gene expression, variations in the num- 
ber of copies of certain genes and levels of DNA 
methylation. They found that tumours in pre- 
menopausal women follow a different playbook, 
especially in terms of gene expression. 

As researchers find rarer and rarer mutations, 
the question of significance becomes more and 
more daunting, Lee says. He has just finished 
looking at a spreadsheet of 2,000 mutations. 
“One of them is the ER mutation, he says, refer- 
ring toa mutation in the oestrogen receptor — a 
common mutation in breast cancers. “But how 
do I sift through the others? That’s the funda- 
mental problem.” 

One way to do it is to analyse the cellular 
pathways that the mutations affect. That means 
using algorithms developed to integrate all the 
collected molecular information and catego- 
rizing it into the common growth or cell-cycle 
pathways. Researchers can use this sorted 
information to describe tumours in terms of 
affected pathways rather than simply affected 
molecules. In one such effort, bioinformatician 
Josh Stuart of the University of California Santa 
Cruz developed a computational method that 
integrates a variety of genomic data sets with 
known cell-signalling pathways. “We know how 
gene circuits work in normal cells. Now we're 
asking, what got broken in this tumour cell?” 
Stuart says. “It’s surprisingly successful.” 

Lee's group used the computational analy- 
sis PARADIGM in their study’. The approach 
proved particularly revealing for oestrogen- 
receptor-positive breast cancers in premeno- 
pausal women. The method demonstrated that 
although the individual molecules that showed 


abnormalities varied, they often occurred 
within a particular set of pathways that signal 
for integrins — proteins involved in the forma- 
tion of tumour-associated blood vessels. 

The evident importance of integrins in 
the tumours of premenopausal women with 
oestrogen-receptor-positive breast cancers 
suggests that these molecules could bea thera- 
peutic target. “There are integrin inhibitors out 
there; Lee says, and some of them have been 
tested in clinical trials. 


FROM KNOWLEDGE TO APPLICATION 

As big-data researchers churn through large 
tumour databases looking for patterns of muta- 
tions, they are adding new categories of breast 
cancer. In 2012, two consortiums published 
papers on their data-driven approaches to 
breast-cancer genomics. The TCGA Network, 
made up of dozens of research institutions in the 
United States and Europe, came up with four 
overarching groupings of breast tumours based 
on genetic and epigenetic abnormalities”. They 
found that only three genes (T'P53, PIK3CA 
and GATA3) were mutated in more than 10% 
of the samples, dem- 
onstrating that rare 
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grated genetic data 

— copy-number and gene-expression changes 
— with long-term clinical outcomes into 
10 families of tumour types. Combined with 
clinical data, both these new groupings have 
the potential to allow oncologists to make bet- 
ter prognoses and treatment decisions’. 

“We're still refining our approach,’ says Oscar 
Rueda, a biostatistician at Cancer Research 
UK’s Cambridge Research Institute, which is 
part of the METABRIC effort. They are now 
fully sequencing the 2,000 samples used in the 
research. Rueda says that the hope is to identify 
driver mutations, which havea role in the initia- 
tion of cancer. “There are a hundred different 
mechanisms by which cells go bad,” he says. 

Big-data approaches may eventually reveal 
cellular pathways that had previously been 
overlooked. Avi Maayan of the Icahn School of 
Medicine at Mount Sinai is working on a path- 
way database to create a resource for future 
potential targets. His effort comes under the 
umbrella of the National Institutes of Health 
Library of Integrated Network-based Cellular 
Signatures (LINCS), which uses data generated 
at institutions such as the Broad Institute of 
Massachusetts Institute of Technology. High- 
throughput labs at the Broad Institute test a 
host of drugs — both experimental ones as 
well as those with regulatory approval — on 
ten different cell lines to study how the drugs 
interact with cellular activity. 
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“You get a signature of what happens to 
cells,” Maayan says. “And signatures can be 
queried for new uses of drugs.” If clinical 
researchers want to turn off a particular cellu- 
lar pathway in cancer, they could use Maayan's 
database to search for drugs that have that 
action. 


CLINICAL TRANSLATION 
The next step is to apply the newly gained 
knowledge of actionable mutations to patient 
care. Research hospitals collect data on 
patients for their own care and to add to the 
knowledge base. At MD Anderson Cancer 
Center, for instance, people with a new cancer 
diagnosis are screened for a selection of cancer 
genes. “It’s not the whole genome, but a panel 
of 200 genes with actionable mutations,’ Navin 
says. As research knowledge grows, so does the 
panel. In the past year, the original 200 genes 
have already expanded to 300, he says. 
Navin’s speciality is single-cell sequencing, 
which allows his lab to study tumour cells 
that are circulating in the blood. One might 
only collect 10 or 20 cells in a sample. “Previ- 
ous analytic methods couldn't process such a 
small number of cells,” he says. The single-cell 
approach opens the possibility that patients 
could be monitored over the course of treat- 
ment with a noninvasive test, such as a blood 
sample. Oncologists could then check if the 
tumour cells are responding to therapy or if 
resistance is emerging. 

Big data intersects with the clinic in the 
form of I-SPY 2, a clinical trial of experimen- 
tal breast-cancer drugs. “We're collecting real 
time data on patients,’ says Laura van't Veer, a 
molecular oncologist at the University of Cali- 
fornia San Francisco. 

Patients are enrolled at diagnosis and, based 
on their tumour signature, placed into one of 
eight pre-defined types. The women are then 
treated with standard treatment and an experi- 
mental targeted drug, while van't Veer and her 
colleagues monitor which tumours respond to 
which targeted therapies. The goal is to evalu- 
ate biomarkers that improve response to tar- 
geted therapies. “With standard chemotherapy, 
we see 30-35% complete remission,’ says van't 
Veer. “Among our 8 subtypes, we sometimes 
get up to 50-60% remission.” 

Plenty of challenges lie ahead. A single 
tumour can host a baffling diversity of muta- 
tions, which change over time. Still, Ma’ayan 
remains the optimist. “The more money and 
effort we can throw at the problem, the more 
snapshots we can get. With better resolu- 
tion, we can improve our understanding of 
the whole process,” he says. “It’s not infinite. 
Although it can feel like it” m 


Jill U. Adams is a freelance science writer in 
Albany, New York. 
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Herceptin binds to a receptor called HER2 that is produced in excess by some breast cancer cells (pink). 


Eyes on the target 


A push to expand the success of a pair of antibody- based 
drugs is buying some women years of freedom from 


breast cancer. 


BY MICHAEL EISENSTEIN 


escriptions of a drug as revolu- 
D tionary, transformative or a home 

run are usually reserved for press 
releases or presentations to investors. But 
oncologists are embracing such language to 
describe two drugs that allow them to offer 
some people with breast cancer a cure rather 
than a consolation. 

The drugs are trastuzumab (Herceptin) 
and pertuzumab (Perjeta). Both are anti- 
body-based agents that target the signalling 
protein HER2, which is produced in abun- 
dance in 20-25% of breast tumours. The 
high levels result in poorly controlled cell 
growth and proliferation. For decades, the 


protein has been regarded as the hallmark 
of a dire prognosis, says oncologist Luca 
Gianni, at the San Raffaele Hospital Scien- 
tific Institute in Milan, Italy. 

Today, many patients with HER2-positive 
tumours are essentially having their cancer 
eradicated by receiving a double-hit of tar- 
geted therapy before surgery. Even patients 
diagnosed with late-stage, metastatic disease 
— once seen as an imminent death sentence 
— are living much longer than ever before. 
In a few exceptional 
cases, the duration of 
these benefits can be 
remarkable. “We never 
use the ‘c-word’ with 
metastatic disease, but 
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I have one patient in my practice who has 
been in complete remission for 13 years,’ 
says Shanu Modi, a medical oncologist at 
New York’s Memorial Sloan Kettering Can- 
cer Center. “I think of her as a cured person 
who just comes by my clinic to visit every 
three months.” 

These two agents exemplify the mod- 
ern model of targeted therapy in oncology 
— give patients personalized treatments 
that selectively hit tumours based on their 
specific set of mutations, rather than con- 
ventional chemotherapy, which is broadly 
toxic to healthy as well as dangerous cells. 
Nevertheless, cancers will return in many 
people who receive prompt treatment. “T still 
don't think we’re curing the vast majority of 
patients,” says Modi. New agents in the clini- 
cal pipeline could improve the effectiveness 
of these targeted agents and help doctors and 
patients to achieve more and longer-lasting 
victories, although some worry that soaring 
costs (see ‘Crippling costs’) will limit the 
reach of these next-generation therapeutics. 


ADYNAMIC DUO 

Trastuzumab was the first HER2-targeting 
drug to reach the market, following a 
phase 3 clinical trial that showed that the 
drug improved the odds of survival by 20% 
in women with metastatic HER2-positive 
breast cancer’. Subsequent data showed 
that giving people the drug after the surgi- 
cal removal of early-stage tumours cut the 
risk of the cancer returning in half”. “That's 
an absolute home run,” says Elizabeth Mit- 
tendorf, a surgical oncologist at Houston’s 
MD Anderson Cancer Center. “We don't see 
numbers like that often in oncology.’ 

In 2012, data from the CLEOPATRA 
clinical trial showed that oncologists could 
expect an even better return by pairing tras- 
tuzumab with pertuzumab”*. Both drugs are 
antibodies that specifically bind HER2, but 
each recognizes a different site on the pro- 
tein, and their combined effects (along with 
conventional chemotherapy) resulted in 
even greater tumour shrinkage and further 
improvements in prognosis for people with 
metastatic breast cancer. The combination 
prolonged median survival by well over a 
year relative to trastuzumab alone. “We 
give that combination to all of our meta- 
static patients as first-line treatment if we 
can,” says Modi, “and they’re being treated 
for three or four years on average.” In two 
further trials**, by using the trastuzumab- 
pertuzumab combination to shrink tumours 
before surgery — an approach known as 
neoadjuvant therapy — clinicians discovered 
that they could eradicate all traces of cancer 
from the breast and lymph nodes (known 
as a pathological complete response) in 
roughly half of patients. 

That is not the same as a cure, however, 
and these trials are too recent to confirm 
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how durable the benefits are. Still, physi- 
cians are already seeing clear benefits in 
other domains. Mittendorf says she can 
now perform less-invasive operations that 
leave the breast largely intact, and neoad- 
juvant treatment can eliminate lymph node 
growths that previously required surgical 
removal. Oncologists are now eagerly await- 
ing the results from the recently concluded 
APHINITY trial, which measured survival 
and recurrence in patients given the com- 
bined therapy after surgery. 

Other HER2-targeted therapies have reached 
the clinic, but without offering such a clear 
patient benefit. Lapatinib, for example, delays 
tumour progression by interfering with HER2 
signalling, but also exhibits more toxicity and 
is generally less effective as a first line of treat- 
ment than its antibody-based counterparts. 
As a result, lapatinib is generally reserved for 
late-stage treatment of patients whose tumours 
acquire resistance to trastuzumab or pertu- 
zumab. Importantly, this treatment may also 
be effective at limiting metastatic growth in 
the brain — a particular threat to patients who 
are HER2-positive (see ‘Staving off a deadly 
invasion). 

Another HER2-targeted drug called 
T-DM1 (Kadcyla) mitigates the severe tox- 
icity associated with conventional chemo- 
therapy agents by physically tethering the 
chemotherapeutic agent DM1 to trastu- 
zumab. This restricts the toxic effect to 
HER2-expressing cells. “There’s no hair 
loss and very little neuropathy,’ says Modi. 
“I think patients are enjoying a much bet- 
ter quality of life on T-DM1.’ As one of the 
first antibody-drug conjugates to reach 
the clinic, T-DM1 extends median survival 
by more than five months in patients with 
recurrent HER2-positive breast cancer rela- 
tive to lapatinib. 

A recently concluded trial called MARI- 
ANNE examined whether T-DM1 could 
replace trastuzumab and chemotherapy as 
the first line of treatment. Although less 
toxic than the standard drug combination, 
T-DM1 proved no more effective at delay- 
ing disease progres- 
sion, and so will 


probably remain a Shere ona 
second-line option. hair pod and 
Gianni hopes that very Ultte 
these results will not neurop athy 1 
prevent clinicians think patients 
from finding smart re emoying 
ways to incorporate a much better 
this safe and gener- quality of life on 
T-DMI1.” 


ally effective drug 
into their regimens. 
“T-DM1 was presented as the solution 
to all of our problems, and now it’s being 
demonized as an expensive failure — but 
this is not so,” he says. “It is simply a drug 
that requires some better thinking and a dif- 
ferent approach.” 
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TARGETED THERAPY 


Crippling costs 


The life-extending advances in targeted 
therapy that are causing so much 
excitement for clinicians and patients with 
breast cancer are not cheap. In the United 
Kingdom, a course of T-DM1, for example, 
costs £90,000 (US$138,000). The country’s 
National Institute for Health and Care 
Excellence rejected routine coverage of the 
drug through the National Health Service 
(NHS) because of the high cost. For now, 
this drug is available through a special fund 
that provides access to medications that 
are not generally available through the NHS, 
although this fund is due to end next spring. 
As costs soar, even the best drugs 
risk becoming bogged down in 
protracted negotiations. Consider the 
antibody-based agent pertuzumab. 
It was approved for presurgical, or 
‘neoadjuvant’ use, by the European 
Medicines Agency in June, but cannot 
be used in this way until the various 
national health systems agree on a price. 
“Until recently, everything that had clear 


CUTTING OFF THE EXITS 

The ability to target HER2 has been a life 
saver for many patients, but it is far from a 
complete victory. Many people who receive 
neoadjuvant treatment will not have their 
cancer eradicated. And although oncolo- 
gists can keep cancer in people with meta- 
static disease at bay for years, this requires 
multiple rounds of therapeutic attack with 
a shifting arsenal of drugs. “The chance of 
having a durable response is much higher, 
and we're seeing patients with metastatic 
disease go through five, six or seven lines of 
treatment,’ says Nancy Lin, an oncologist at 
the Dana-Farber Cancer Institute in Boston, 
Massachusetts. “But the disease is generally 
not curable.” 

The mechanisms by which HER2-positive 
tumours acquire resistance to the drugs that 
once laid them low are poorly understood. 
Gianni sees the tumour environment, which 
contains a diverse mixture of cells with 
distinct mutational profiles, as part of the 
problem. “If 30% of the cells in a tumour 
overexpress HER2, that’s a HER2-positive 
tumour,’ he says, “but many cells still do not 
express that target.” Thus, killing off trastu- 
zumab-vulnerable cells will still leave a large 
cancerous community. This highlights the 
value of generalized chemotherapy, and 
Francisco Esteva at New York University’s 
Langone Medical Center suggests that this 
phenomenon may also be to blame for 
T-DM1’s modest performance in MARI- 
ANNE. “If you target too much, the other 


benefits was approved and reimbursed,” 
says oncologist Thomas Bachelot at the 
Centre Léon Bérard in Lyon, France. “But 
costs are really getting very high very 
rapidly, and there will be some tough 
decisions to make.” 

In the United States, private insurance 
generally covers these pricey treatments, 
but policies don’t always cover the 
full costs, leaving many patients with 
crippling bills. In August, more than 
100 clinicians signed a letter in Mayo 
Clinic Proceedings® to support a patient 
petition for vigorous pricing reforms, 
noting that every oncology drug approved 
by the Food and Drug Administration 
in 2014 cost upward of US$120,000 
per year — and that patients routinely 
shoulder up to a quarter of that cost. “We 
are negating therapeutic opportunities,” 
says oncologist Luca Gianni at the San 
Raffaele Hospital Scientific Institute in 
Milan, Italy, “and this should not happen.” 
ME. 


clones can escape,’ he says. 

However, Thomas Bachelot, a medi- 
cal oncologist at the Centre Léon Bérard, 
in Lyon, France, does not believe that this 
is the only mechanism by which tumours 
can recur after an initial therapeutic vic- 
tory. “I do a lot of biopsies, and they always 
remain HER2-positive — they don’t lose 
it? he says, adding that even if the tumour 
rebounds while patients are taking trastu- 
zumab or pertuzumab, they still draw some 
benefit from those drugs. If HER2-targeted 
therapies are halted, he says, patients tend to 
fare even worse than if they had stayed the 
course. This outcome hints at other cellular 
pathways and processes that amplify or miti- 
gate the effects of HER2-targeted treatment. 
These pathways might therefore serve as 
useful biomarkers to guide therapy. The trial 
data point to sub-populations with strong 
responses in both directions — one neoadju- 
vant trial* found that nearly 17% of patients 
had a pathological complete response from 
targeted drugs without any chemotherapy, a 
result that suggests that some patients could 
skip the most toxic components of treat- 
ment. Other tumours remain stubbornly 
unresponsive. “About 10% of our HER2- 
positive patients do not respond at all to 
trastuzumab and pertuzumab,’ says Bach- 
elot. “There is this huge primary resistance, 
and we don’t know why.” 

Another apparent pathway to resistance 
arises from hyperactivation of a signalling 
cascade known as the phosphoinositide 
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3-kinase (PI3K) pathway. Preliminary clini- 
cal studies suggested that everolimus, a drug 
that interferes with PI3K signalling, might 
increase the effectiveness of trastuzumab 
against metastatic breast cancer. Although 
results from two phase 3 trials proved dis- 
appointing, subsequent examination of the 
data revealed that everolimus may delay 
progression in patients with mutations that 
affect PI3K activity’, However, Modi believes 
that future studies should hit this pathway 
through alternate means. “You can get a few 
weeks improvement from everolimus, but 
with a lot of toxicity,” she says. “There have 
got to be better drugs for targeting PI3K.” 

Her team is evaluating one such drug, 
BYL719, and has also seen promising 
results from another approach to bolster- 
ing HER2-targeted treatment. Tumour 
cells rely on a molecule called heat-shock 
protein 90 (HSP90) to manage the produc- 
tion of HER2, and Modi and colleagues have 
found that chemical inhibition of HSP90 
can stall HER2-dependent tumour growth’. 
“In our first phase 2 trial, we combined an 
HSP90 inhibitor with Herceptin and saw a 
nearly 25% response rate from just these two 
agents, without any chemo,’ she says. Her 
team is working with two different HSP90 
inhibitors, including a compound that can 
be isotopically labelled so that the extent to 
which tumours are taking up the drug can be 
directly monitored. 

Most breast tumours also show excessive 
activity by the receptors that respond to the 
reproductive hormones oestrogen and pro- 
gesterone, and numerous studies suggest 
that these hormone receptors influence the 
response to HER2- 


targeted drugs. Data “Wye now have 
from dual-therapy plenty of drugs 
neoadjuvant tri- that exploit 

als indicate that the different 
pathological com- mechanisms 
plete response rate tion. the 
jumps from around of ac ae 
50% to more than challenge is to 
80% in patients with use them side the 
hormone receptor- optimal way. 


negative tumours’. 

A growing body of data demonstrate that 
these two signalling pathways collaborate to 
promote growth’, suggesting that multiple 
hits may be necessary to limit the tumoutr’s 
escape routes. 

Ultimately, combinations of highly targeted 
treatments with more broadly active therapy 
regimens may hold the key to ensuring that 
no patient with cancer gets left behind, Gianni 
says. “We now have plenty of drugs that exploit 
different mechanisms of action,” he says. “The 
challenge is to use them in the optimal way.’ For 
now, these targeted therapies are giving some 
patients the opportunity to live cancer-free, and 
giving many others the prospect of additional 
years of life — and a chance to think differently 


METASTATIC TUMOURS 


Staving off a deadly invasion 


HER2-positive breast cancer can spread to the brain (green cross in left image); some patients 
have been shown to respond to targeted therapy to reduce the metastasis (right). 


Doctors and patients who wage a 
successful first-round bout with an HER2- 
positive breast tumour may subsequently 
find themselves grappling with a more 
fearsome opponent. “About half of 

those patients who have HER2-positive 
metastatic tumours develop cancer in 

the brain,” says oncologist Nancy Lin 

of the Dana-Farber Cancer Institute in 
Boston, Massachusetts. Part of the cause 
is biological, and Lin believes “there is 
probably some inherent predilection 

for these tumours to go to the brain”. 
However, the remarkable success of 
targeted treatments is itself a factor. The 
prolonged life of patients with metastatic 
breast cancer simply creates more 
opportunities for brain growths to form. 

It remains unclear whether trastuzumab 
and pertuzumab are effective at hitting 
these metastases. Some researchers 
believe that the capillaries within the brain 
have too tight a seal for these relatively 
large drugs to penetrate. However, 
oncologist Thomas Bachelot of the Centre 
Léon Bérard in Lyon, France, thinks that 
this barrier is sufficiently leaky within 
tumours to grant these drugs access, 
and points to trial data suggesting that 
patients with stable brain metastases!° 


about their futures. “It's completely changed the 
conversation in my office,” Mittendorf says. m 


Michael Eisenstein is a freelance science 
writer living in Philadelphia. 
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respond well to targeted treatment. 
Bachelot and colleagues have also found’? 
that the small-molecule drug lapatinib, in 
combination with chemotherapy, controls 
brain metastases well. “You get very good 
results in terms of time to progression that 
are equivalent to what you see with disease 
outside the brain,” he says. 

Radiotherapy approaches have also 
become more sophisticated; rather than 
subjecting patients to highly damaging 
whole-brain irradiation, clinicians can 
now selectively hit lesions while sparing 
healthy nervous tissue. People who once 
faced six-month life expectancies can 
survive five years or longer. 

For years, the presence of disease 
in the brain automatically disqualified 
patients from being involved in trials. 

“It was often just part of the boilerplate 
language for eligibility,” says Lin, resulting 
in missed opportunities to identify 
regimens that might control or kill such 
growths. These restrictions are now 
being removed. Some companies are 
considering opening up trials to patients 
with untreated or progressive brain 
metastases, she says, a move that could 
quickly widen the battle against invasive 
breast cancer. M.E. 
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Some of the operations performed by breast-cancer surgeon Shelley Hwang may no longer be needed if better biomarkers for breast cancer are found. 


MOLECULAR BIOLOGY 


Marked progress 


Reliable markers could eliminate surgery and radiation therapy for many women diagnosed 
witha type of cancer that often does not progress beyond its non-invasive form. 


BY HANNAH HOAG 


Tien years ago, Mary Jane Lapinski had 
a routine breast-cancer screening mam- 
mogram at her local hospital in Balti- 
more, Maryland. The mammogram showed 
multiple specks in her left breast. Her physician 
called it ductal carcinoma in situ (DCIS) — an 
early-stage, non-invasive cancer of the milk 
ducts. A surgeon told her he could attempt a 
lumpectomy to remove the lesions, but he rec- 
ommended a mastectomy — removal of the 
entire breast. “I kept thinking, this isn’t logi- 
cal? says Lapinski, who was 48 at the time. “It 
was mind-boggling that a non-invasive cancer 
carried the same or more aggressive treatment 
than an invasive cancer.” 

The rogue cells in Lapinski’s breast occupied 
a diagnostic grey area. Some cases of DCIS 
advance to invasive breast cancer, metastasis 


and death, but most do not. By current esti- 
mates, 20-30% of DCIS tumours will become 
aggressive within 20 years, says Shelley Hwang, 
a breast-cancer surgeon and researcher at Duke 
University School of Medicine in Durham, 
North Carolina. Still, most oncologists feel that 
it is best to remove the lesions and offer radia- 
tion treatment to stave off their progression. 
The trouble is that oncologists cannot tell 
for certain which DCIS lesions will remain idle 
and which will turn deadly. Identifying breast- 
cancer biomarkers — molecules that can iden- 
tify the pre-cancerous cells that are likely to 
progress to invasive cancer — could lead to 
better-informed decisions about treatment. 
Unfortunately, little is known about the natu- 
ral history of DCIS. It is difficult to track the 
course of the disease because so many women 
undergo surgery. “If we can identify a subset 
of patients that are at risk of developing an 
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invasive cancer and only treat those, we would 
spare many women unnecessary treatment,” 
says Eileen Rakovitch, a radiation oncolo- 
gist at Sunnybrook Health Sciences Center in 
Toronto, Canada. 


SURGING DIAGNOSIS 

Before the introduction of widespread screen- 
ing mammography in the 1980s, DCIS lesions 
represented about 3% of breast cancers in the 
United States. They now account for nearly 
one-third of newly diagnosed breast cancers’. 
But detecting DCIS does not necessarily add 
much information about a womans future or 
overall health. “It is entirely possible to find 
cancers that don't matter,’ says H. Gilbert 
Welch, an internist and cancer epidemiolo- 
gist at the Geisel School of Medicine at Dart- 
mouth in Hanover, New Hampshire. Welch 
and Archie Bleyer, then at the University of 
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Texas Medical School in Houston, 
estimated that, in 2008, 70,000 US 
women received an early-stage 
breast-cancer diagnosis for lesions 
that would not have led to clinical 
symptoms, accounting for 31% of 
screening-detected breast cancers”. 

Most women diagnosed with 
DCIS have a lumpectomy or mas- 
tectomy — ora double mastectomy 
— along with radiation therapy. But 
the benefits of such treatments are 
hard to find. A much-discussed 
observational study of more than 
100,000 US women with DCIS 
found that women who had 
lumpectomies or mastectomies to 
treat DCIS had just a 3.3% chance 
of dying of breast cancer in the next 
20 years, not much different than 
the risk to women in the general 
population’ (2.7%, according to 
the American Cancer Society). 

Ideally, women with DCIS would be tested 
to assess whether surgery is the best course of 
action. Although no such test is clinically avail- 
able, physicians are starting to use biomarkers 
to predict the future of women who have already 
had surgery. One test — Oncotype DX DCIS 
Score, produced by Genomic Health in Red- 
wood City, California — stratifies women who 
have had breast-conserving surgery for DCIS 
into low, medium and high risk for future can- 
cer. The test evaluates the expression of seven 
cancer genes (including those associated with 
cell proliferation and hormone receptors) in tis- 
sue samples taken from breast biopsies. 

Rakovitch validated DCIS Score in a retro- 
spective study of women diagnosed with DCIS 
and whod had breast conserving surgery. In 
work funded by a research grant from Genomic 
Health, she and her colleagues applied the test 
to tissue samples from 718 women*. “Women 
with an intermediate- or high-risk score had 
twice the risk of developing local recurrence 
compared to women with a low-risk score,” says 
Rakovitch. She says that the assay can pick out 
some women who are at a high risk of recur- 
rence, but whom doctors might have consid- 
ered to be low risk based on patient history and 
tumour characteristics. 


SEARCHING FOR SIGNPOSTS 

The search for a reliable measure to prevent sur- 
gery in the first place goes on. Any test, Hwang 
says, would probably involve a large array of 
markers that could be combined to form a 
cohesive picture. “We've taken the individual 
biomarkers as far as they can go and they're not 
giving us the answers we need,’ she says. 

Thea Tlsty, a molecular pathologist at the 
University of California, San Francisco, and 
her colleagues have identified three proteins 
involved in cell proliferation that are associ- 
ated with future aggressive breast cancers’. 
Tlsty’s team found that of 1,162 women who 
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Ductal carcinoma in situ is a common type of non-invasive breast cancer. 


had.a lumpectomy for DCIS, those whose tissue 
was positive for all three biomarkers — COX-2, 
p16 and ki67 — had a 20% risk of developing 
an invasive cancer within 8 years. If they had 
none of the proteins, their risk dropped to 4%. 
“These markers indicate which pre-cancers 
are the baby, basal-like cancers, which are the 
most lethal and metastatic,” she says. In unpub- 
lished research, Tlsty’s group has subsequently 
identified several other potential biomarkers in 
proteins that coordinate cell death. Four pro- 
spective studies in Australia, the United States 
and the United Kingdom are further evaluating 
the trio of markers, Tlsty says. 

Other biomarkers have also shown prom- 
ise. Invasive breast cancers often stop the 
expression of tumour suppressor genes. One 
of those genes, called SYK, seems to be part of 
a genetic hub that determines which precan- 
cerous cells eventually metastasize. One study 
found that women who had altered expres- 
sion of 55 genes that interact with SYK had 
reduced survival’. 

The search for circulating markers for early 
detection has proved frustrating at times, says 
Jeffrey Marks, a cancer cell biologist at Duke 
University. Marks and his colleagues selected 
90 blood-based biomarkers, but none were 
useful in distinguishing cases of breast cancer 
from benign controls’. “They're very difficult 
to validate in independent populations,’ he says. 

Some researchers are looking for signals that 
might reveal which DCIS lesions are associated 
with an increased risk of developing future 
invasive breast cancer. Andy Beck, a compu- 
tational biologist at Harvard Medical School in 
Boston, Massachusetts, and his team are exam- 
ining patterns of genomic alteration. Using 
data from DNA profiles of invasive breast 
cancers catalogued with The Cancer Genome 
Atlas, the group identified genomic locations 
that are most frequently copied or deleted in 
invasive breast cancer lesions. “We're basically 
saying that if it’s not present in invasive cancer 
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then it’s not likely to be useful,” 
says Beck. 

In this case, the marker proved 
to be grimly robust. In a study of 
271 patients, women with lesions 
that had extra copies in all three 
regions had a 17-fold higher risk 
of having a coincident invasive 
breast cancer compared with those 
women who had none’. The group 
is expanding the study to include 
about 20 chromosomal regions 
commonly altered in invasive 
breast cancer. In collaboration with 
Stanford University, Washington 
University and the Nurse's Health 
Study, the group is launching a 
study of 1,400 patients to predict 
the risk of recurrence or a subse- 
quent invasive cancer over time. 

Researchers have recognized 
that sheer genetic diversity within 
precancerous tissues may help to predict can- 
cer formation and progression. As precancer- 
ous cells evolve and accumulate genetic and 
epigenetic alterations, they become more var- 
ied. Some studies have shown that diversity can 
predict progression. Marks is now studying 
the genetic diversity of the cells within DCIS 
lesions. In theory, if the DCIS has a more com- 
plex mosaic of cells, there is a stronger likeli- 
hood that one of them will develop into a more 
‘fit’ cancer cell that can invade the surrounding 
tissue and metastasize. 

Until scientists have a fuller understanding 
of which markers indicate an increased risk of 
developing invasive cancer, patients with DCIS 
will lack clarity about their future. For her part, 
Lapinski never went under the knife. Instead, 
she tracked down Hwang, who suggested that 
Lapinski join a three-month clinical study of the 
oestrogen-blocking drug tamoxifen. Lipinski 
was supposed to have surgery at the end of the 
trial, but she opted to forego the operation and 
continue with the tamoxifen, and, later, ralox- 
ifene. She checks in with Hwang twice a year for 
an examination and a mammogram. Although 
others see uncertainty in Lapinski’s choice, she 
doesn't see it as a risky move. “Everybody has 
to make their own decision,’ she says. “It has to 
be comfortable for them,” m 


Ad 


Hannah Hoag is a freelance science writer in 
Toronto. 
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Relative risk 


Mutations in BRCA genes predispose women to cancer, but 
outside influences shape the ultimate risk. 


BY MOISES VELASQUEZ-MANOFF 


transformed how we think about cancer 
with a single discovery: a mutation that 
dramatically increased carriers’ risk of ovar- 
ian and breast cancer’. The gene BRCA 1 codes 
for a protein that is important in DNA repair. 
The mutated version impairs defences against 
tumours, increasing the lifetime risk of breast 
cancer in King’s cohort to more than 80%, and 
the risk of ovarian cancer to as high as 40-65%. 
By comparison, the risk in the general popula- 
tion is 12% for breast cancer and 1.3% for ovar- 
jan cancer. 
Four years later, another group identified a 
mutation in a second gene — BRCA2 — that 


I n 1990, geneticist Mary-Claire King forever 


also elevated the risk of these cancers, although 
by less. Mutations in the two BRCA genes are 
now thought to account for between 5 and 
10% of all breast cancers, and 15% of ovarian 
cancers. These discoveries stand as landmark 
successes of the genomic era. 

Although testing women for BRCA muta- 
tions is now commonplace for women with a 
family history of the disease (see ‘Should all 
women be tested?’), the path between muta- 
tion and cancer is complex. Some studies 
show that the risk from 


BRCA mutations varies ODNATURE.COM 
among different popu- _ For more on cancer 
lations, suggesting that __ predisposition 
any particular woman's __ genes: 


fate depends on more _ go.nature.com/xcxnlw 
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than just her genes”. Among women who carry 
the mutation, additional factors — including 
exposure to oestrogen — may shape the risk of 
disease. Understanding the interplay between 
genes and the environment could illuminate 
the ultimate origins of breast cancer, possibly 
leading the way to new strategies for preven- 
tion and treatment. 


UNEQUAL RISKS 

Some of the disparity in the risk from BRCA 
mutations is generational. One repeated find- 
ing is that, by age 50, mutation carriers born 
in the early twentieth century seem to have a 
lower risk of cancer than those born later’. The 
pattern suggests that outside influences interact 
with genes, and that something in the environ- 
ment has changed in an unfavourable way. If 
researchers can figure out what those influences 
are, and why they have increased disease preva- 
lence, maybe in the future they will gain new, 
less invasive tools to delay disease onset — and 
possibly prevent hereditary cancers altogether. 

In 2003, King persuasively showed that the 
link between BRCA mutations and the risk of 
cancer varied with time. For Ashkenazi Jewish 
carriers born after 1940, the likelihood of devel- 
oping breast cancer by age 50 was nearly triple 
that of women born before that date. “These 
people can be in the same family,’ says King, 
who is at the University of Washington, Seat- 
tle. “This is not genetic. The whole risk curve is 
getting shoved younger.’ This ‘cohort effect’ has 
been replicated by numerous researchers over 
the years, but its meaning is debated. 

King attributes the generational shifts in 
BRCA-associated risks primarily to two trends: 
earlier starts to menses, and later first pregnan- 
cies. Women have been delaying first pregnan- 
cies more and more over the course of the past 
century. Meanwhile, girls now have their first 
menstruation about two years earlier than they 
did in the late nineteenth century. 

Together, earlier menarche and later first 
pregnancy have increased the average woman's 
exposure to the sex hormone oestrogen, which 
is thought to promote tumour survival and 
growth. King believes this lengthened period 
of oestrogen exposure increases the risk of 
hereditary and non-hereditary cancers alike. 


WESTERNIZED HORMONES 
Other researchers, however, think it is impor- 
tant to understand how our overall hormo- 
nal milieu may have changed over the past 
100 years or so. Gillian Bentley, an anthro- 
pologist at Durham University in the UK who 
studies Bangladeshi immigrants, thinks that 
society-wide shifts could partly explain the 
increase of cancer during the past century both 
in BRCA mutation carriers and non-carriers. 
One line of evidence is that the reproductive 
hormone levels of Bangladeshi immigrants 
vary according to when the women arrived 
in the United Kingdom. For those who came 
before puberty, adult hormone levels are similar 
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to native-born Britons. But if they arrived after 
puberty, their hormone levels remain sup- 
pressed relative to native Britons, but similar 
to levels of women in Bangladesh. Accordingly, 
South Asian immigrants who arrived as adults 
tend to develop breast cancer less often than 
native Britons. But their British-born children 
have a risk closer to native Britons. “They’re all 
from the same genetic background. We match 
them in terms of region of origin. And they 
move environments, and they look completely 
different,” she says. “What does that say about 
genes?” 

Bentley suspects that childhood infections 
probably hamper the supply of reproductive 
hormone levels in people who grow up in Bang- 
ladesh. Because such infections were common 
throughout all societies in the past, it is possible 
that a similar scenario protected previous gen- 
erations of western women from breast cancer. 
The lesson, however, is not to reinstate early-life 
infections, but to remember that genes inter- 
act significantly with the environment, Bentley 
says. “We need to understand the complexities,” 
she says, “and not be too simplistic in saying 
that genes determine your destiny.” 


IMPROVING THE ODDS 

Joanne Kotsopoulos, a cancer researcher at the 
University of Toronto in Canada, is trying to 
help women who carry a BRCA mutation by 
identifying steps that they can take to protect 
themselves. Overweight women tend to pro- 
duce extra growth factors and sex hormones, 
so staying slim may be one option. In a cohort 
of nearly 1,100 women, Kotsopoulos found that 
BRCA carriers who had 
lost at least 4.5 kilo- 


ee grams between ages 18 
of things you and 30 had around half 
can’t change ; : 
the risk of developing 
about your 
tics.” breast cancer by age 49 
SENees. compared with carriers 
who did not lose weight. 


By contrast, the use of oral contraceptives before 
age 20 correlates with a 45% increased risk of 
cancer in BRCAI carriers by age 40 (but not 
ovarian cancer)’. Because of the protective effect 
of oral contraceptives against ovarian cancer, 
she advises carriers to begin taking them once 
they reach 25. 

What most excites Kotsopoulos, however, is 
exercise. She was first inspired by King’s 2003 
study, which linked exercise in adolescence with 
areduced risk of cancer later. In adults, exercise 
may lower breast-cancer risk by decreasing hor- 
mone and growth-factor levels. Regular physical 
activity in childhood can also delay menarche, 
shortening the period of oestrogen exposure. 
But Kotsopoulos suspects that exercise also 
helps in another way: by directly activating 
BRCA genes. 

Many mutation carriers have one function- 
ing BRCA gene. Exercise activates the functional 
copy sufficiently, Kotsopoulos thinks, to partly 
compensate for the non-functional copy. Inan, 
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MUTATION SCREENING 


Should all women be tested? 


Last year, Mary-Claire King, the geneticist 
who discovered the link between BRCA 
mutations and an increased risk of breast 
and ovarian cancer, and her colleagues 
argued in the Journal of the American 
Medical Association that all women should 
be screened for mutations in BRCA1 and 
BRAC2 (ref. 6). This approach, she says, 
would give women who might not otherwise 
know they are carriers a chance to consider 
proactive steps, including preventive 
surgery. 

Some are sceptical of King’s proposal, 
however. Beverly Levine, a health policy 
expert at Wake Forest University School of 
Medicine in Winston Salem, North Carolina, 
says that the cost of such a programme 
per life saved would be forbiddingly high. 
She is concerned that universal testing 
might capture women with mutations who, 
because of little understood genetic or 


as yet, unpublished study, she found that sed- 
entary mutation carriers have less BRCA gene 
expression than more active carriers. 

These are just associations, she stresses. They 
require confirmation in prospective research, 
and further testing in intervention studies — 
they are not meant to replace preventive sur- 
gery for mutation carriers. Rather, Kotsopoulos 
goal is to provide evidence-based advice to the 
significant number of patients who decline 
surgery. Eventually, however, understanding 
these ‘soft-risk’ modifiers may lead to new, 
less-invasive treatments, including drugs that 
mimic the BRCA-activating effects of exercise 
— an approach Kotsopoulos has begun testing 
in people. 


A CARCINOGENIC MICROBIOME? 

Mysteriously, however, even when researchers 
control for risk modifiers, such as body fat and 
physical activity, they still cannot completely 
abolish the cohort effect. All else being equal, 
older women still seem to have a lower lifetime 
risk of breast cancer than younger women. To 
explain the increase over time of breast cancer 
generally, some researchers are turning their 
attention to the human microbiome. 

Susan Erdman, a microbiologist at Massa- 
chusetts Institute of Technology in Cambridge, 
suspects that diet- and antibiotic-driven shifts 
in our microbial communities have, by encour- 
aging chronic inflammation, increased the risk 
of breast, ovarian and prostate cancer in west- 
ernized populations. 

Erdman has proven the basic concept in 
animal models. In mice, a junk food diet 
can increase the risk of these malignancies, 
apparently by altering the microbiome and 


environmental interactions, actually have a 
low risk of cancer. But oncologists will still 
recommend that they undergo prophylactic 
surgery. “Any time you have to involve a 
large number of people to prevent one 
case of disease,” Levine says, “there are 
unintended consequences.” 

Such criticism does not faze King. 
Variants of unknown significance simply 
should not be included in the test results, 
she says. As for expense, the cost of these 
tests is falling rapidly. By her estimation, 
it might take 300 tests to save one life. 
And at a cost of US$200 per test, she feels 
that the investment is worth the payoff. 
She is emphatic that younger women in 
particular need to get tested, even if they 
do not have breast or ovarian cancer in 
the family. “Women need to know,” she 
says. “Their genes aren’t going to go 
away.” M.V.-M. 


lowering the immune system’s ability to halt 
inflammation. 

In the future, Erdman says, therapeutically 
targeting the microbiome could help to lower 
cancer risk. “There are lots of things you can’t 
change about your genetics,” she says. “But 
there's lots you can change about your interac- 
tion with microbes.” 

Looking forward, scientists are trying to 
understand how cancers linked with BRCA1 
and BRAC2 mutations interact with non- 
hereditary factors (as well as with other genes). 
Perhaps a better understanding of this inter- 
play will allow oncologists to provide more 
precise, individualized risk assessments that 
reduce the number of unnecessary surgeries. 
And maybe one day carriers will know how to 
reliably, and meaningfully, alter their risk of 
developing breast cancer with interventions 
other than surgery. 

For now, King says that the options for 
women carrying a BRCA mutation remain 
limited. They can either live with increased risk 
and fear, or they can get a preventive operation. 
“I wish there were interventions that were safe 
that we could use,” she says. “We don't pres- 
ently have a non-surgical intervention.” = 


Moises Velasquez-Manoff is a freelance 
science writer in Berkeley, California. 
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Mammography is used to examine the soft tissues of the breast in order to screen women for cancer. 


Don’t look now 


Mammogram screenings are an established part of women’s 
health care, but are they more trouble than they are worth? 


BY EMILY SOHN 


ack in 1980, Canadian researchers 
Bee to tackle a pressing question: 

how many women could be saved with 
breast-cancer screening? At the time, many 
researchers suspected that the relatively new 
technology could prevent breast-cancer deaths 
by detecting tumours before they had a chance 
to grow. The truth proved to be complicated — 
so complicated that researchers are still trying 
to understand it 35 years later. 


The Canadian trial enrolled nearly 90,000 
women from across Canada and randomly 
assigned them to receive either mammograms 
and physical breast examination or just breast 
examinations’. By 1990, there were enough data 
to begin analysis. But the researchers were con- 
fused by what they were seeing: the women who 
had received mammograms were not living any 
longer than those who had not. The team was 
so perplexed that it decided to delay publication, 
says Cornelia Baines, a physician epidemiolo- 
gist at the University of Toronto in Ontario. “We 
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thought, ‘if we wait another two years, maybe a 
benefit will begin to emerge,” she says. “After 
25 years, it still hasn't emerged” 

Health campaigns often give women a simple 
message: mammograms save lives. But the data 
behind that message is so murky that experts 
continue to disagree about who really needs 
to be screened. Various organizations offer 
clashing recommendations that range from 
the American College of Obstetricians and 
Gynecologists suggesting that regular mam- 
mograms should start at age 40 to the Swiss 
Medical Board proposing the elimination of 
routine screening altogether. At the same time, 
scientists continue to sort through the data from 
multiple trials, enlist the power of computer 
models, conduct epidemiological analyses, and 
take a closer look at the physical and emotional 
tolls of cancer treatments to address a vexing 
problem: are mammograms helping women, or 
hurting them? 


INVADER DETECTION 

Decades after it started, the Canadian trial illus- 
trates the essence of the mammography conflict 
— overall, women who were tested did not live 
longer, even though screening did uncover 
many cases of invasive cancer. Those diagnoses 
set off rounds of chemotherapy, radiation and 
surgery. But some of the women were already 
too sick to save, and others had cancers that 
could have been ignored: the authors estimated. 
that 22% of the invasive cancers detected by 
mammograms were unlikely to affect a woman's 
health, a phenomenon known as overdiagnosis. 

The Canadian trial is not the final word, 
however, and other studies have suggested that 
widespread screening with mammograms may 
reduce the overall death rate from breast cancer, 
although by how much is another question that 
scientists continue to debate. One review of the 
literature concluded that mammograms can 
reduce deaths by 20% (ref. 2), whereas another 
that looked at 7 randomized trials put the num- 
ber at 10-15% (ref. 3). 

After systematically reviewing studies that 
compared similar groups of women who 
received different rates of screening, a team led 
by Russell Harris, an epidemiologist and cancer- 
prevention researcher at the University of North 
Carolina School of Medicine in Chapel Hill, and 
his colleagues estimated that regular screening 
for women between the ages of 50 and 69 could 
reduce the death rate by about 10% (ref. 4). The 
benefits are smaller for women in their 40s, Har- 
ris says, because the cancer is less common. 

For many women, the chance to reduce their 
risk of dying from breast cancer, even bya small 
amount, with a fairly simple procedure makes 
mammography seem worthwhile. But there are 
reasons to be cautious, says H. Gilbert Welch, 
an internist at the Geisel School of Medicine at 
Dartmouth in Hanover, New Hampshire. What 
a 20% benefit actually adds up to is a reduc- 
tion from 5 deaths out of 1,000 women over a 
10-year period to 4 deaths. 
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Proving that mammograms, along with 
other screening tests, extend lives has been 
notoriously difficult, Welch adds. One reason 
is that randomized trials — the gold-standard 
for assessing any kind of medical procedure 
— need to follow tens of thousands of people 
for many years, often requiring “heroic efforts” 
to keep tabs on mostly healthy people, he says. 
Among other challenges, it is hard to standard- 
ize the procedure. Results vary depending on 
who is reading the images, how hard they look 
for tumours and what kind of abnormalities get 
flagged for a call-back. 

As trials have marched on over the decades, 
the landscape of breast cancer has shifted. A 
rising awareness makes women much more 
likely to detect lumps and get them checked 
out without the need for mammograms. 
Meanwhile, new, targeted treatments allow 
oncologists to extend the lives of women with 
advanced cancers who would have died owing 
to lack of options when trials began. “As we 
become better able to treat advanced stages 
of breast cancer,’ Welch says, “it becomes less 
important to find early forms of it.” 


TOO MUCH KNOWLEDGE 

Determining the value of mammograms must 
take into account the problem of overdiag- 
nosis. Peter Gotzsche, director of the Nordic 
Cochrane Centre in Copenhagen, and author 
of Mammography Screening: Truth, Lies and 
Controversy (CDC, 2012), led a study that sug- 
gested that one-third of Danish women are 
overdiagnosed’. In other countries, including 
the United Kingdom, Australia and Sweden, 
the overdiagnosis rate as a result of introduc- 
ing screening programmes is suggested to be in 
excess of 50% (ref. 3). 

But not everyone agrees that too many 
women are being screened. Etta Pisano, a 
radiologist at the Medical University of South 
Carolina in Charleston, argues that the real 
problem is over-treatment that occurs after the 
mammogram. She compares mammograms to 
metal detectors at airports. Security personnel 
want to take a close look at anything that looks 
remotely suspicious because they “want to find 
all the guns,” she says. But when the ‘guns’ are 
tumours, which can behave unpredictably, the 
path forward remains a judgement call. 

One major problem, Welch suspects, is 
that technological improvements (see ‘Fewer 
places to hide’) have given screening tests the 
power to detect more small and slow-growing 
cancers — twice as many in 2008 as in 1976, 
according to one study*. But the tests are still 
not finding more of the early stages of aggres- 
sive, fast-growing and lethal tumours that 
often appear between mammograms. If rou- 
tine screening tests were catching the worst 
cancers early, Welch says, the data should show 
progressively fewer late-stage diagnoses as the 
number of early-stage diagnoses goes up, along 
with a reduction in death rates. But when he 
and his colleagues looked at data on 16-million 
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NEW TECHNOLOGY 


Fewer places to hide 


Mammograms are not the only way to 
screen for breast cancer. Ultrasound and 
magnetic resonance imaging have been 
available for years, particularly for women 
at high-risk of the disease. And a wide range 
of new technologies is making ever-smaller 
tumours increasingly easy to spot. 
Tomosynthesis (pictured) takes X-ray 
slices of the breast to create a 3D image. 
According to a 2014 analysis? of more than 
450,000 breast examinations from 13 
centres around the United States, combining 
tomosynthesis with digital mammography 
(which uses computers instead of X-ray 
film) found more cancers with fewer false 


American women living in nearly 550 counties 
that varied in mammography rates from less 
than 40% to nearly 80% they found that neither 
is happening’. Mammograms may do a good 
job at finding cancers, he says, but they do not 
seem to save many lives. 

Mammography also carries risks. Besides the 
low-dose radiation from the X-rays, women 
with abnormal results often go on to be scanned 
with even more powerful imaging machines, 
and sometimes this is followed by biopsies, 
chemotherapy, surgery and other painful or 
risky procedures that can cause infections or 
worse. Radiation therapy as a treatment for 
breast cancer increases deaths from heart dis- 
ease by more than 25% and from lung cancer by 
nearly 80% — a big risk fora woman who may 
not need to take it’. The consequences are not 
just physical. Harris notes that being labelled as 
a ‘cancer patient’ can turn lives upside down. 
And although the emotional fallout has been 
poorly studied, interviews with women suggest 
that worrisome mammogram results are com- 
mon sources of anxiety, intrusive thoughts, 
insomnia and other kinds of distress that can 
endure for a lifetime, he says. 

There are also financial concerns. In one 
study, researchers totalled the money spent 
on breast cancer in the United States between 
2011 and 2013. Using data collected by a 
health insurer on more than 700,000 women’, 
the authors found that US$4 billion was spent 
each year on care for breast cancer as a result 
of false positives and overdiagnosis, including 


positives than did mammography alone. But 
tomosynthesis delivers nearly twice as much 
radiation, says Etta Pisano, a radiologist 
at the University of South Carolina in 
Charleston. And comparative data are 
lacking. Pisano is planning a trial that would 
randomly assign 70,000 women to have 
either tomosynthesis or mammography 
to obtain the first accurate assessment of 
which technique helps women the most. 
Further down the line, technologies in 
development include breast computed 
tomography scanners that are similar to 
tomosynthesis, but capture a more complete 
3D image without as much radiation; phase- 
contrast imaging, which would show more 
detail of the interface between tumours 
and the surrounding tissue; and a bra that 
uses thermodynamic sensors to look for 
temperature fluctuations that might indicate 
signs of cancer. For now, the US Preventive 
Services Task Force has determined 
that more research is needed before it 
can recommend any of the emerging 
technologies. E. S. 


extra X-rays and biopsies. “It is a lot of money 
and it gives you a sense that this is not a small 
problem,” says co-author Kenneth Mandl, a 
biomedical informatics specialist at Harvard 
Medical School in Boston, Massachusetts. 

Meanwhile, a strong narrative of breast 
cancer survivorship endures, partly because 
it is impossible to know after treatment ends 
whether it was needed in the first place. “That 
is the screening paradox,” says Gotzsche. “The 
more healthy women you give an overdiagnosis 
to, the more they will tend to be happy because 
they think, ‘Screening saved my life?” 

It is an enduring irony that plagues the daily 
decisions women make about their health. “I 
believe mammography helps a few women,” 
says Welch. “Butit’s a very few and it comes with 
tremendous human costs.” = 


Emily Sohn is a freelance journalist living in 
Minneapolis, Minnesota. 
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BREAST CANCER 


4 BIG QUESTIONS 


Which cancers need 
to be treated? 


Who should be having : 


mammograms, and 
how often? 


What are the risk 
factors for the 
disease? 


Treatment for breast cancer can 
be disfiguring, expensive and 
painful. Identifying the tumours 


*, that actually pose a threat 


can focus therapy where it is 
needed, while saving millions of 
women from potentially harmful 
interventions. 


Women face a cacophony of 
conflicting recommendations 
regarding screening. Should they 
be screened every year starting 


* atage 40, every 2 years at age 


50 or never? Although over- 


screening can create false alarms, 


under-screening could lead to 
preventable deaths. 


Harnessing the immune system 
to fight breast cancer could 

be a winning strategy, a major 
disappointment or both. For 


“certain women, immunotherapy 


could mean not just a treatment, 
but also a cure. 


Knowing which women are 
susceptible to breast cancer 
— and why — could illuminate 


_ the root causes of the disease 


and lead to new approaches for 
prevention and treatment. 


Although treatments 
for breast cancer 
have come along 
way over the past 
few generations, 
researchers are still 


puzzling over some 
tough questions. 


BY CHRIS WOOLSTON 


Roughly 15-30% of tumours 
detected by screening are 
unlikely to cause trouble if left 


. alone. Ductal carcinoma in situ, 


—acommon cancer of the milk 
ducts — rarely turns invasive, but 
women with the disease often 


undergo surgery (see page S114). | 


Mammograms prevent about 

4 cancer deaths for every 1,000 
women screened regularly for 
20 years. The benefits seem to 


* be especially small for women in 


their 40s (see page $118). 


With more than 40 clinical 
trials underway, data continues 
to roll in (see page S105). So 
far, immunotherapy, including 


* anti-cancer vaccines, alone or in : 
combination with other therapies, 


seems to be especially promising 
for cases of ‘triple-negative’ 
breast cancer. 


Some women are more likely to 
develop breast cancer because 
they carry a mutation ina BRCA 


'. gene (see $116), but even the 


risk for these women is shaped 
by exposure to hormones, which 
in turn are influenced by exercise, 
weight control and pregnancy. 


Researchers are sorting 
through genetic and molecular 
biomarkers that could help 


, to forecast the course of 


an individual tumour. Such 
predictions may never be 100% 
accurate, but a little more clarity 
could go a long way. 


Radiologists are working to 
improve the accuracy and 
clarity of breast imaging, and 
public health experts are 


: developing evidence-based 


guidelines that can save lives 
while minimizing harm. 


Each case of cancer is different 
and so is each immune system 
Matching the right treatment with 
the right patient could prove to 
be the defining challenge of the 
future. 


Researchers are looking beyond 
BRCA for other gene mutations 
that could help to set breast 


- cancer in motion. The genetic 


links probably will not be as 
strong, but finding them could 
still prove lifesaving. 


What are the : i i : i ini : is di , 
possibilities, i tri . ; 
and limits, of ; inni j i 7 : : ‘ 
immunotherapy? di i : : fl i i : i i 


Chris Woolston is a science writer based in Billings, Montana. 
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Doubtful health benefit of 
screening from 40 years of age 


Philippe Autier 


Refers to Moss, S. M. et al. Effect of mammographic screening from age 40 years on breast cancer mortality in the UK Age trial at 


17 years’ follow-up: a randomised controlled trial. Lancet Oncol. http: 


dx.doi.org/10.1016/S1470-2045(15)00128-X 


Results of the UK Age trial suggest a significant benefit of annual 
mammography initiated at 39-41 years of age in preventing breast- 
cancer deaths occurring before the age of 50 years; however, this 
approach had no effect on the risk of breast-cancer death occurring 
before the age of 60 years and leads to prolonged deteriorations 

in quality of life owing to overdiagnosis. 


Organized programmes of mammographic 
screening have focused predominately on 
women aged =50 years; whether screening 
would be of benefit to younger women, such 
as those aged 40-50 years, remains unclear. 
Moss et al.! have now published long-term 
results of the UK Age trial, which is the only 
study that has specifically addressed the 
population-wide efficacy of mammography 
screening starting at the age of 40 years. 
Moss et al.'* randomly assigned women 
aged 39-41 years into either an intervention 
group comprising 53,883 women who were 
invited to undergo annual mammography 
screening, or a control group of 106,953 
women who did not undergo screening from 
this age; 68% of women invited for annual 
mammography regularly participated, 
and these women received a maximum of 
seven rounds of screening up to the age 
of 48 years.'” At 50-52 years of age, all of the 
women were invited to participate in the UK 
NHS breast-screening programme (mammo- 
graphy every 3 years). Causes of death were 
recorded up to 20 years after randomization 
(median follow-up 17.7 years). Moss et al. 
concluded that screening of women aged 
40-49 years is associated with an early reduc- 
tion of breast-cancer mortality based on a 
statistically significant decrease in the relative 
risk of this outcome during the first 10 years 
after initiation of annual screening (RR 0.75, 
95% CI 0.58-0.97; Table 1). However, this 
risk reduction faded away beyond the 10-year 
time point; at year 20, the risk of breast-cancer 
death in both groups was not significantly dif- 
ferent (RR 0.93, 95% CI 0.80-1.09). What is 


the public-health relevance of annual mam- 
mography screening from 40 years of age if 
starting less-frequent screening at an age of 
50 years is associated with the same long-term 
risk of death from breast cancer? This ques- 
tion demands further consideration of the 
benefits, harms, and costs of earlier screening. 

Of note, whether the transient risk reduc- 
tion observed can be attributed to screening 
is questionable because of the ‘left-to-nature’ 
design of the UK Age trial.! According to this 
design, only women invited to screening as 
part of the intervention group knew they 
were enrolled in the clinical trial. Women 
allocated to the control group were never con- 
tacted and, therefore, were unaware that they 
were participating in a trial. Furthermore, 
the health professionals involved in the trial 
knew or could identify the women who were 
invited to screening, but were unaware of 
the women comprising the control group. 
These imbalances might have increased 
disease awareness and the quality of medical 
management of women in the intervention 
group, thus introducing performance biases. 
Moreover, the knowledge among the health 
professionals, who were ultimately respon- 
sible for completing the death certificates, 
that certain women attended mammography 
screening might have led to under-reporting 
of breast cancer as the underlying cause of 
death, as suggested in other studies.*# 

The possibility that the left-to-nature 
study design led to an overestimation of risk 
reductions is supported by two facts. First, 
the significant reduction in breast-cancer 
mortality was observed in the first 10 years 


when the left-to-nature design was enforced, 
but not thereafter (Table 1)—when invitations 
to screening were the same for all women. 
Most deaths attributable to breast cancer 
occur several years after diagnosis; therefore, 
deaths prevented as a result of earlier detec- 
tion of breast cancers in women aged up to 
48 years in the intervention arm would have 
been expected to translate into a persistently 
reduced rate of breast-cancer mortality 
beyond the last round of annual screening, 
compared with that of the control arm. Thus, 
the similarity in breast-cancer mortality 
between these cohorts during the last 10 years 
of the trial, when considering only the women 
with cancers diagnosed in the first 10 years 
(Table 1), indicates that earlier detection of 
cancers as a result of screening during the 
intervention period might have been quite 
limited. Second, using the Nottingham 
Profile Index that combines tumour size and 
grade, and the lymph-node status, Moss et al.° 
previously predicted a relative risk of breast- 
cancer death over a 10-year period of 0.90 
(95% CI 0.80-1.01) for the women diagnosed 
with breast cancer during the first 8 years of 
the trial. This predicted relative risk of breast- 
cancer mortality is much lower than the rela- 
tive risk of 0.75 reported in the most-recent 
publication of the trial,’ which suggests that 
the mortality reductions observed in the first 
10 years of the trial might be attributable to 
factors other than screening. 

Analysis of the Nelson—Aalen curves of 
the cumulative breast-cancer mortality for 
all women with breast cancers diagnosed 
during the follow-up period of up to 20 years 
reveals that in the last 10 years, the difference 
in cumulative mortality between the inter- 
vention and the control groups progressively 
shrank'—at year 20, no significant difference 
was seen.' Interestingly, a relatively constant 
separation of the mortality curves between 
year 10 and year 20 was observed when 
only the women with cancers diagnosed 
during the intervention period were con- 
sidered.' These intriguing trends imply that 
the mortality associated with breast cancer 
diagnosed in the last 10 years of the trial 
was greater in the intervention group than 
in the control group. To verify this finding, 
I derived the death rates attributed to breast 
cancers diagnosed during years 10-20 by 
subtracting the rates in each of the mortality 
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Table 1 | Rates and relative risks of breast-cancer mortality in the UK Age trial* 


Breast-cancer Number of breast-cancer 


Women-years 


Breast-cancer-death rate 


Relative risk of Rate difference per 


deaths deaths per 1,000 women-years breast cancer 1,000 women- 

gonsideted Intervention Control Intervention Control Intervention Control deaths Cl) Te hGe Nalin a 
group group group group group group 

In women diagnosed <10 years after randomization 

Within 10 years 83 219 532,747 O53 9322) 0.156 0.207 0.75 (0.58-0.97 =0:051 

Beyond 10 years 99 193 408,221 810,295 0.243 0.238 1.02 (0.80-1.30) 0.005 

Up to year 20 182 412 940,969 1,868,717 0.193 0.220 0.88 (0.74-1.04) -0.027 

In women diagnosed >10 years after randomization* 

Beyond 10 years 60 103 408,221 810,395 0.147 OM27, 1.16 (0.84-1.59 0.020 

In all women diagnosed up to year 20 after randomization 

Up to year 20 242 feyils} 940,969 1,868,717 0.257 0.276 0.93 (0.80-1.09 -0.018 


*All data come from the publication by Moss et al.* *Figures calculated based on data from Moss et al.* 


curves; this calculation produced a relative 
risk of death of 1.16 that was not signifi- 
cant owing to the small numbers of deaths 
(Table 1). Nevertheless, this result indicates 
that a persistent excess of deaths in the inter- 
vention group during this period contributed 
to equalizing death rates in the two groups 
at year 20. These observations echo those 
of previous studies, which concluded that 
annual mammographic screening of women 
in their forties would result in an increase in 
breast-cancer mortality.°* In particular, peri- 
odic exposure of the breasts to X-rays starting 
at age 39-41 years in the UK Age trial might 
have increased the risk of breast-cancer death 
at the later time points. 

Overdiagnosis (and thus overtreatment) 
is another concern associated with use 
of screening programmes. In the context of 
screening, overdiagnosis refers to the detec- 
tion of in situ or invasive cancers unlikely to 
become clinically evident during the patient's 
lifetime.’ Moss et al.' computed rates of over- 
diagnosis in the UK Age trial by compar- 
ing the incidence of such breast neoplasms 
in the two cohorts, with the investigators 
suggesting that use of earlier screening was 
associated with “at worst a small amount 
of overdiagnosis”! However, the frequency of 
overdiagnosis in women invited to screening 
at some point in their life compared with 
the rate in women never invited to screen- 
ing remains the more-important question. 
To examine this question, I extracted data 
from the article on the cumulative incidence 
rates of in situ and invasive breast cancers 
and regressed the rates observed during the 
first 8 years in the control group—when no 
screening was performed in this cohort. 
In the absence of mammography screening, 
the cumulative incidence of breast cancer has 
been shown to increase linearly with age;’° 


therefore, using the linear-regression slope 
obtained for the data from the control cohort 
up to year 8, I computed a 20-year cumula- 
tive incidence of breast cancer of 27 per 1,000 
women. The quasi-identical cumulative inci- 
dence at 20 years reported for both groups in 
the trial was around 42 per 1,000 women.! 
This difference of approximately 15 cases 
per 1,000 women suggests a 35% (15/42) fre- 
quency of overdiagnosis of in situ and invasive 
breast cancers in both groups. Interestingly, 
the data on the cumulative incidence of these 
cancers presented by Moss et al.' also shows 
that the level of overdiagnosis is independent 
of the age at which screening starts, indicating 
that the reservoir of nonprogressing cancer- 
ous lesions that are detectable by mammogra- 
phy is present for long periods of time. Thus, 
if the rates of overdiagnosis are similar when 
screening is started at the ages of 39-41 years 
or 50-52 years, women who participate in 
screening at an earlier age risk longer periods 
of deteriorated quality of life (relating to 
mutilation, including mastectomy, and other 
adverse effects of treatments, as well as associ- 
ated detriments in self-image, sexual relation- 
ships, psychological distress, and so on) 
owing to overdiagnosis and overtreatment. 

The UK Age trial provides evidence that 
starting mammography screening at the age 
of 40 years would provide or no additional 
health benefit over screening initiated at an 
age of 50 years, whereas the harmful conse- 
quences of such a programme would persist 
for longer periods. Methods of screening for 
breast cancer other than mammography need 
to be developed and evaluated. 
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Precision medicine for metastatic breast 
cancer—limitations and solutions 


Monica Arnedos, Cecile Vicier, Sherene Loi, Celine Lefebvre, Stefan Michiels, Herve Bonnefoi 
and Fabrice Andre 


Abstract | The development of precision medicine for the management of metastatic breast cancer is an 
appealing concept; however, major scientific and logistical challenges hinder its implementation in the clinic. 
The identification of driver mutational events remains the biggest challenge, because, with the few exceptions 
of ER, HER2, PIK3CA and AKT1, no validated oncogenic drivers of breast cancer exist. The development of 
bioinformatic tools to help identify driver mutations, together with assessment of pathway activation and 
dependency should help resolve this issue in the future. The occurrence of secondary resistance, such as 
ESR1i mutations, following endocrine therapy poses a further challenge. Ultra-deep sequencing and monitoring 
of circulating tumour DNA (ctDNA) could permit early detection of the genetic events underlying resistance 

and inform on combination therapy approaches. Beside these scientific challenges, logistical and operational 
issues are a major limitation to the development of precision medicine. For example, the low incidence of most 
candidate genomic alterations hinders randomized trials, as the number of patients to be screened would 

be too high. We discuss these limitations and the solutions, which include scaling-up the number of patients 


screened for identifying a genomic alteration, the clustering of genomic alterations into pathways, and the 


development of personalized medicine trials. 


Arnedos, M. et al. Nat. Rev. Clin. Oncol. advance online publication 21 July 2015; doi:10.1038/nrclinonc.2015.123 


Introduction 

The discovery of the oestrogen receptor (ER)' and 
human epidermal growth factor receptor-2 (HER2)? 
as therapeutic targets in patients with breast cancer 
has enabled treatment success in terms of patient out- 
comes with ER or HER2-blocking therapies,** and set 
the stage for the development of stratified medicine. 
Furthermore, progress in cancer genomics research 
over the past few decades has reinforced the notion 
that cancer is driven by various genomics alterations.° 
As a result of different international initiatives such as 
The Cancer Genome Atlas (TCGA) or the International 
Cancer Genome Consortium (ICGC), the use of next- 
generation sequencing (NGS) has helped define the 
genomic landscape of early stage breast cancer.® These 
studies have revealed the high level of tumour hetero- 
geneity for each breast tumour that consists of several 
molecular subsets, which are driven by distinct molecu- 
lar alterations, indicating that tumours could be treated 
according to their individual molecular landscape. 
Despite the exciting potential for personalized medicine, 
ER and HER2 are currently the only targetable molecu- 
lar alterations with confirmed predictive and prognostic 
value.** Other targeted therapies, such as mTOR and 
CDK4/6 inhibitors, have been approved on the basis of 
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their efficacy in subgroup populations, but no predic- 
tive biomarkers have been found.”* In this Review, we 
discuss the potential applications of genomics to improve 
the management of metastatic breast cancer (MBC), and 
consider the challenges that precision medicine must 
overcome before it can be widely implemented in the 
clinic—most notably, those challenges that relate to 
the remarkable cellular complexity of this type of cancer. 


Genomic landscape of breast cancer 

Analysis of the molecular features of early stage breast 
cancer using NGS has led to the description of the 
genomic landscape of this disease.° This research has 
confirmed that TP53 and PIK3CA mutations are the 
most frequent genomic alterations overall in all intrinsic 
subtypes (28% for both genes). Amplifications in ERBB2, 
FGFR1 and CCND1 follow in frequency, being observed 
in 10-20% of all breast cancer subtypes. Additional 
alterations are less frequent, but could be highly clinically 
relevant, including PTEN mutations and deletions, and 
AKT1, RB1 BRCA1 or BRCA2 mutations. Sequencing 
analyses have uncovered mutations in other genes of 
interest that might have some clinical relevance in breast 
cancer, including KRAS, APC, NF1, SKT11, MAP2K4, 
MAP3K]1 and AKT2.° A similar genomic study focused 
on patients with triple-negative breast cancer (TNBC) 
revealed a more heterogeneous molecular profile, with 
some tumours having just a few molecular alterations 
whereas others harboured hundreds of alterations.’ 
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Key points 


» Recent research data defining the genomic landscape of breast cancer have 
reinforced the notion that this disease is driven by genomic alterations 
= So far very few gene drivers validated in breast cancer have been identified, 


including ESR1, ERBB2, PIK3CA and ATK1 


» Identification of drivers, characterization of resistant clones, identification of 
DNA repair defects and mechanisms of immune suppression are potential uses 


of genomics to personalize medicine 


= The development of precision medicine for the treatment of breast cancer has 
several major challenges that include low frequency of targetable molecular 
alterations, feasibility of high-throughput technologies and availability of 


targeted therapy 


Table 1 | Targetable genomic alterations in breast cancer 


Gene Alteration Frequency (%) Candidate drug Level of 
evidence for 
the target 

Growth factor receptors 

ERBB2 Amplifications >10 HER2 inhibitor al 

Mutations 3 
EGES| Amplifications 5-10 FGFR inhibitor 4 
FGFR1 Amplifications 5-10 FGFR inhibitor 2 
FGFR2 Amplifications 1-5 FGFR inhibitor 2 
IGF1R Amplifications 1-5 IGFR inhibitor 4 
EGFR Amplifications 1-5 EGFR inhibitor 2 
PI3K/AKT/mTOR 
PIK3CA Amplifications >10 PI3K inhibitor 1-2 
Mutations 

PIK3R1 Mutations 1-5 Not known 

PTEN Mutations 5-10 AKT inhibitor 
Deletions 

AKT1 Amplifications 1-5 AKT inhibitor 2 
Mutations 

AKT2 Amplifications 1-5 AKT inhibitor 

AKT3 Amplifications 1-5 AKT inhibitor 4 

INPP4B Deletions 1-5 AKT inhibitor NA 

MEK pathway 

NF1 Mutations 1-5 MEK inhibitor 26 

KRAS Amplifications 1-5 MEK inhibitor 2C 

BRAF Amplifications 1-5 MEK inhibitor 2C 

JNK pathway 

MAP2K4_ Mutations 5-10 Not known NA 

Deletions 
MAP3Ki Mutations 5-10 Not known NA 
Deletions 
GPS2 Mutations 1-5 Not known NA 
Cell cycle 
CCND1 Amplifications >10 CDK4 inhibitor 4 
CDKN2A _ Deletions 5 Not known NA 
CDKN1B Alterations 1-5 Not known NA 
CDK4 Amplifications 1-5 CDK4 inhibitor 
Rb Mutations 5-10 Resistance to 
Deletions CDK4 inhibitor 

DNA repair 

BRCA1 Mutations 1-5 PARP inhibitor 1 
Deletions 


Interestingly, a comparison of the RNA sequencing with 
the genomes/exomes in this study revealed that only 
36% of validated somatic single nucleotide variations 
were observed in a transcriptome sequence, which poses 
a question regarding the use of NGS alone to identify 
potential drivers of breast cancer. Finally, proteomic 
analyses in relation to the TCGA dataset in early stage 
breast cancer have indicated that three pathways are pre- 
dominantly activated in all subtypes: PI3K/AKT/mTOR, 
p53 and CCND1/CDK4/Rb.° A large proportion of the 
molecular alterations identified can be targeted by new 
therapeutics; a non-exhaustive list of candidate targetable 
genomic aberrations is shown in Table 1. 

Molecular analyses of metastatic breast cancers suggest 
that breast cancer can evolve over time, losing and/or 
gaining new alterations. As an illustration, discordances 
in ER, progesterone receptor (PgR), and HER2 status 
between the primary tumour and the metastasis have 
been reported in around 16%, 40%, and 10% of patients, 
respectively, ° with loss of HER2 expression reported in 
around 24% of patients.'! Several studies have investi- 
gated the genomic landscape of breast cancer from early 
to metastatic disease, identifying ESR1 mutations as one 
of the genomic alterations that mediates resistance to 
aromatase inhibitors (AI).!*!* ESR1 mutations occur in 
10-30% of ER-positive MBC that are resistant to Als, 
and lead to ligand-independent activation of the ER. 
Whether ESRI mutations are present as a minor sub- 
clone when the cancer arises or are acquired during the 
treatment is still unknown; however, the percentage of 
these mutations increases under the selective pressure of 
therapy and are able to drive resistance. So far, few large 
comprehensive analyses have reported the genomic land- 
scape of metastatic breast cancer. One of these studies 
performed whole-exome sequencing (WES) of 93 biopsy 
samples from patients with MBC, and reported, in addi- 
tion to the already known molecular aberrations in the 
early setting, a higher incidence of TSC1/TSC2 muta- 
tions." In the study by Toy et al.,!? TP53, PIK3CA and 
GATA3 were the most frequently identified molecular 
alterations.’? In addition to the previously mentioned 
mutations in ESRI, these investigators have identi- 
fied mutations in RPTOR and ERBB3 at much higher 
rates than those reported in the TGCA. In the latest 
interim analysis of the currently ongoing MOSCATO- 
01 trial,° a molecular screening of 700 patients with 
advanced-stage cancer, including around 70 with meta- 
static breast cancer, alterations of the PTEN/PI3K/AKT 
and FGFR/FGF pathways were the two most frequently 
detected actionable pathways observed across all tumour 
types. The now proven evolution of breast cancer over 
time constitutes the basis for assessing metastases to 
direct therapy in trials of precision medicine. Other 
potential mechanisms of acquired resistance have been 
reported in other disease subtypes, including reversal 
of BRCA1/2 mutations as a mechanism of secondary 
resistance to PARP inhibitors’® or, in the HER2-positive 
segment, the acquisition of ERBB2 mutations.'” These 
mechanisms need to be confirmed, and potential new 
aberrations have yet to be identified. 
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Table 1 (cont.) | Targetable genomic alterations in breast cancer 


Gene Alteration Frequency (%) Candidate drug Level of 
evidence for 
the target 

DNA repair (cont.) 

BRCA2 utations 1-5 PARP inhibitor al 

Deletions 

ATM Mutations 1-5 PARP inhibitor 

ATR utations 1-5 PARP inhibitor 

MDM2 Amplifications 1-5 MDM2 inhibitor 4 

P53 Mutations >10 Not known NA 

ER signalling 

ESR1 Mutations >10% in metastatic Not known 2 

Amplifications ER+ MBC resistant to 
Translocations endocrine therapy 

GATA3 Mutations 5-10 Endocrine therapy 

FoxA1 Mutations 1-5 Endocrine therapy 

Epigenetics 

KMT2C utations 5-10 Drug targeting 4 

epigenetics 

KMT2B Mutations 1-5 Drug targeting 4 

epigenetics 

KDM6A Mutations 1-5 Drug targeting 4 

epigenetics 

SEFD2 Mutations 1-5 Drug targeting 4 

epigenetics 

Others 

NOTCH3 Amplifications 1-5 NOTCH inhibitor 4 


Abbreviations: HDAC, histone deacetylases; NA, not available. 


Tools for precision medicine 

The detection of genomic alterations in patients with 
breast cancer can help to tailor therapy. Over time, 
several molecular tools have been developed to serve 
this type of personalized therapeutic approach in breast 
cancer. Historically, immunohistochemistry was used 
to stratify patients with breast cancer according to 
the presence of certain biomarkers, and is currently 
used to determine the expression of ER and HER2."*° 
Fluorescence in situ hybridization (FISH) analysis was 
then developed as a method to quantify copy number. 
Currently, identification of ERBB2 amplifications is 
the main standard application of this technique; in the 
future, it could also be used for the quantification of 
other amplifications such as FGFRI or CCND1. Several 
multigene assays have also been developed to quantify 
gene-expression using either RT-PCR (Oncotype DX®),” 
DNA array (Mammaprint®),”’ or NanoString technolo- 
gies (Prosigna™).” These assays are currently used in 
early stage breast cancer to stratify patients according 
to their risk of relapse, but these technologies could be 
applied to other applications in the metastatic setting. 
For example, gene-expression arrays can accurately 
quantify gene expression and, therefore, determine 
which pathways might be activated, such as mTOR” or 
MEK,” in order to target the most relevant pathways in a 
particular tumour. More recently, comparative genomic 
hybridization (CGH) arrays and single nucleotide 
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polymorphism (SNP) analyses have been used to identify 
targetable genomic alterations in breast cancer.* These 
technologies quantify DNA copy number in the whole 
genome, and enable the detection of gene loss and gene 
amplification, and their application in clinical practice 
is feasible in around 70% of patients.*° As mentioned 
before, the main genomic alterations in breast cancer 
include FGFR1, CCND1, ERBB2, EGFR, MDM2, AKT3 
amplifications, and PTEN deletions.*? NGS is currently 
used to detect mutations (and in some instances, copy- 
number changes) and has several potential applications 
in the clinic. In most trials NGS is used to detect clonally 
dominant mutations in a multigene panel, although, 
ultradeep sequencing can be used to detect minor sub- 
clonal alterations that would potentially mediate further 
resistance to targeted therapy. Some centres are starting 
to use WES in the clinical setting.” In the context of 
breast cancer, WES could be particularly appealing to 
detect genome scars mediated by DNA repair defects, 
which could be used as a predictive biomarker for the 
use of PARP inhibitors.”* Other applications include the 
detection of driver alterations and neoantigens. New 
tools are also being developed that could be comple- 
mentary to the ones mentioned previously. For example, 
RNA-seq could be used to quantify RNA expression 
and detect structural changes, such as mutations and 
translocations. The detection of ctDNA is also becom- 
ing increasingly popular, and has been shown to be 
feasible in providing a readout of the tumour genomic 
landscape in patients with breast cancer.”*” This techno- 
logy is being developed as a noninvasive alternative to 
biopsies to monitor drug efficacy and to detect genomic 
alterations involved in resistance. Finally, technologies 
centred on nucleic acid detection will not be sufficient 
to obtain a comprehensive molecular profile of tumours, 
as protein expression patterns also need to be consid- 
ered. Several technologies are being developed, and 
include Theranostics Health’s TheraLink assay, which 
measures phosphoprotein levels in tumour samples to 
identify pathways that are activated. One could, there- 
fore, hypothesize that quantifying the levels of HER2, ER, 
mTOR and CDK¢ expression could be useful to identify 
the subset of patients that could benefit from approved 
targeted therapies. 


Genomics and lethal breast cancers 

Within the context of metastatic breast cancer, four 
potential applications of genomics to improve patient 
outcomes are currently available (Figure 1). The first 
one is the identification of oncogenic drivers. A genomic 
driver could be defined as the molecular alteration 
responsible for cancer progression, thus, targeting this 
gene is expected to have a therapeutic effect, namely 
circumvention of oncogene addiction.*' ER and HER2 
are the historical drivers of breast cancer. Other potential 
drivers, which have been identified by high-throughput 
technologies, are potential targets of genomic-driven drug 
development.*'**” Most of these mutated genes, with the 
exception of PIK3CA, are present at low frequencies 
(<10%) and have yet to be validated. On the one hand, 
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Figure 1 | Possible applications of genomics to personalize therapy of MBC. A first 
application is the identification of new gene drivers for MBC, aside from ER, 
ERBB2, PIK3CA and AKT1. Further applications include the characterization of the 
molecular alterations specific for the resistant clones (application II), such as the 
presence of ESR1 mutations in patients previously treated with an aromatase 
inhibitor; the identification of DNA repair defects that might influence treatment 
decision (application III) and the characterization of mechanisms of immune 
suppression developed by the tumour cell (application IV). Abbreviation: MBC, 
metastatic breast cancer. 


several genomic alterations have already been associated 
with an objective response when targeted clinically. These 
include PIK3CA mutations,*? FGFR1 amplification,**** 
AKTI1 mutations,” and EGFR amplifications.” In addi- 
tion, ERBB2 mutations present a very strong preclinical 
rationale.** On the other hand, some genomic alterations 
have been reported as candidate drivers, but have not 
been found to be predictive for the efficacy of targeted 
therapies. This was the case for CCND1 amplifications 
and the CDK4/6 inhibitor palbociclib.*’ This finding 
indicates that some drivers of cancer progression could 
be detected through analysis at the RNA/protein level, 
and not only at the DNA level. In fact, ER expression is 
the most important driving force of cancer progression 
and, notably, no DNA alterations in ESRI are detected in 
most patients.’* The same principle could apply to mTOR 
and CDK¢ inhibition, as these pathways are activated in 
most tumour samples, even in the absence of any pathway 
specific genomic alterations.***” Thus, there is a need to 
develop molecular tools that measure pathway activa- 
tion, rather than gene amplifications or mutations, either 
through RNA or protein expression. 

The second possible application of genomics in 
advanced-stage breast cancer is the identification of 
genomic alterations responsible for secondary resistance. 
As discussed previously, ESR1 mutations are observed 
with frequencies of up to 10-30% in tumours previ- 
ously treated with Als.'*!° Moreover, preclinical studies 
have demonstrated that the presence of these mutations 
renders the cancer cell insensitive to Als, but partly sen- 
sitive to endocrine therapies that target ERs directly, as 
these mutations, which mostly affect the ligand-binding 
domain, induce the production of a constitutively active 
receptor with a conformational structure that limits 


the binding of agents to the receptor.'!? Comparison of 
TCGA and WES results have shown that these molecu- 
lar alterations are not present or are present in very low 
levels in the pretreatment samples, even when detected 
using ultra-deep sequencing.” In April 2015, the 
ER-degrading agent GDC-0810 has demonstrated anti- 
tumour activity in patients with ER-positive tumours, 
even when the tumours harboured an ESRI muta- 
tion.*® Thus, detecting ESR1 mutations, or any other 
secondary mutations, in patients with advanced-stage 
breast cancer could enable treatment with a specific 
targeted therapy. Of note, other molecular alterations 
that could be involved in secondary endocrine resist- 
ance include TSC1/2 mutations that could be targeted 
using an mTOR inhibitor." Similarly, other genomic 
alterations that might be involved in resistance to tar- 
geted therapies, such as PTEN mutations and deletions, 
have been reported to mediate resistance to a-selective 
PI3K inhibitors.“' Secondary mutational events could 
be determined by performing ctDNA analysis and 
ultradeep sequencing; analysis of cDNA might enable 
these genomic alterations to be detected early in disease 
evolution (even before relapse) and ultradeep sequenc- 
ing could enable detection of these mutations as soon as 
the tumour becomes detectable. Nevertheless, it must be 
acknowledged that no evidence exists in breast cancer 
that ultradeep sequencing will lead to the detection 
of minor subclonal alterations, such as the previously 
mentioned ESR1 or PTEN mutations. Whether detect- 
ing these genomic alterations early during the disease 
course will lead to improvement in outcomes remains a 
major clinical question in the field of precision medicine. 

A third application of genomics in lethal breast cancer 
is the identification of DNA-repair defects, mutational 
processes and defects in the DNA duplication mech- 
anism. Indeed, the use of high-throughput WES could 
identify the defective DNA repair pathway in each 
patient. For example, a study using WES to analyse 21 
primary breast tumours and matched normal DNA 
samples from the same individuals identified five 
biologically distinct mutational signatures present 
in these 21 cancers (named A-E), characterized by a 
different expression profile, and each signature con- 
tributing to the cancer development in all 21 patients, 
each to a different extent.” This study also identified 
the apolipoprotein B mRNA-editing enzyme catalytic 
polypeptide (APOBEC) family of cytidine deaminases 
as having a role in generating particular genome-wide 
mutational signatures, including a signature of localized 
hypermutation (kataegis). 

In fact, the APOBEC enzymes, which have evolved 
as key players in natural and adaptive immunity, have 
been proposed to contribute to cancer development and 
clonal evolution of cancer by inducing collateral genomic 
damage owing to their DNA deaminase activity.” Data 
also suggest that high APOBEC activity is a source of 
PIK3Ca mutations in the helical domain, whereas in 
tumours with low APOBEC activity, PIK3Ca is equally 
likely to be mutated at the kinase domain hot spot and 
the helical domain. * 
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Figure 2 | Limitations and solutions for precision medicine in MBC. Several limitations and possible solutions to the 
development of a precision medicine approach for the management of MBC are depicted in this figure. Some limitations 
are related to the tumour specimen and its molecular profiling; while others are related to the identification of driver 
genomic alterations to guide targeted therapy. Even when a potential driver has been identified, several challenges arise 
in selection of the targeted therapy, including availability of optimal therapy and drug development in cohorts defined by 
a genomic alteration. Possible solutions are also reported. Abbreviations: ctDNA , circulating tumour DNA; L, limitation; 


MBC, metastatic breast cancer; S, solution. 


DNA-repair deficiencies can also be detected using 
other technologies. For example, the use of SNP arrays 
could help to detect those tumours with homologous 
recombination deficiencies that might be sensitive to 
PARP inhibition.“ Sequencing of candidate genes could 
help to identify DNA repair defects in the tumour and 
lead to development of individualized treatment strat- 
egies. As an illustration, PARP inhibitors have been 
shown to be effective specifically in patients with 
BRCA1/2 mutations.*° Other genes of interest in this 
category include ATM and ATR.**”’ Finally, detection 
of TOP2A amplifications could be useful to determine 
which patients should receive anthracyclines. 

The fourth potential application of high-throughput 
technologies in the treatment of advanced-stage breast 
cancer is the identification of mechanisms of immune 
escape at the individual level. Anti-PD1 treatments have 
shown antitumour activity as monotherapy in TNBC, 
with an objective response of 18.5%.” Nevertheless, tools 
that might predict efficacy of such drugs are currently 
lacking, and there is a need to find targets to develop 
other immunotherapeutics. Genomics can potentially 
be used to evaluate five different aspects of the immune 
system for the purpose of precision medicine. First, it has 
been suggested from studies of patients with melanoma 
and lung cancer that the detection of neoantigens and 
mutational load could predict efficacy of immune check- 
point modulators.* The applicability of such a concept to 
breast cancer is still unknown. Second, genomics could be 
used to quantify expression of ligands of immune check- 
point proteins. Third, sequencing and gene-expression 
analysis could detect those cancer cells that are resistant to 
cytotoxic T-cell lymphocytes (CTL). For example, some 


cancer cells present either a TAP1 mutation or they do 
not express MHC class I; both of them confer resistance 
to CTL.*! Fourth, genomics could be used to quantify and 
qualify the local immune system. For example, infiltra- 
tion by CD8* T cells could predict efficacy of anti-PD1 
therapy; thus, gene-expression analysis could quantify 
such infiltration and determine the level of Th, immune 
response for a given tumour.” Finally, genomics could 
be used to detect genetic polymorphisms that are associ- 
ated with immune defects. Of note, some breast cancers 
harbour SNPs in TLR4 and P2RX7 that can be associ- 
ated with immune suppression and lack of response 
to immunotherapeutics or chemotherapy.” All these 
approaches could have a potential clinical application, 
and their use in the case of neoantigens would seem 
particularly promising. 


Limitations of precision medicine 

Despite the continuing advances in high-throughput 
technologies and their multiple applications in cancer, 
there are several challenges that need to be resolved 
before they can be applied for personalized medicine in 
metastatic breast cancer (Figure 2). The most important 
limitations and some potential solutions are described 
below. These challenges can be divided into ‘scientific 
challenges versus ‘logistical and operational’ challenges 
(Table 2). 


Driver identification 

Most of the genomic-driven trials failed to report high 
levels of antitumour activity of therapy in patients with 
metastatic breast cancers.****°° The biggest challenge 
to develop precision medicine in breast cancer will be to 
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Table 2 | Limitations of precision medicine for breast cancer, and possible solutions 


Limitations 


Evidence 


Possible solutions 


Logistical and operational challenges 


Completion of trials testing 
drugs in genomic segments 


is challenging 


Genomic results cannot be 
provided for a substantial 
number of patients 


Number of patients with an 


identified oncogenic driver 
is low 


Drug access is limited 


Genomic alterations are rare, yet 
randomized clinical trials are still required 
for approval 


Biopsy is not feasible in all patients, low 
percentage of cancer cells in some patients 


Number of genes screened is too low 
Known DNA alterations do not explain cancer 
progression in a large number of patients 


Drug development is limited to small 
number of locations and patients 


Scale-up molecular screening and cluster genomic 
alterations into pathways 

Move to personalized medicine 

Develop fast-track approval based on comparison 
with historical controls 


Collection of ctDNA and analysis using in-depth 
NGS 


Use multiplex assays 
Use RNA and protein-based assays to identify 
drivers 


Develop programs for drug access 
Limit the panel genes to those for which drugs 


Genomic tests are too 
expensive and therefore not 
affordable by the majority 


private companies 


Scientific challenges 


Response rates are low 
Multiple pathways activated 
Intratumour heterogeneity 


Secondary resistance occurs 
treatment pressure 


Tumour adaptation using protein networks 


Tumour evolution 


Genomic tests for clinical use are run by 


Failure to identify oncogenic drivers 


Additional genomic alterations under 


are available 


Run genomic tests in academic centres to allow 
access to innovation for all patients 


Improve tools to identify drivers 

Develop protein and RNA based predictors 

Develop treatment or drug combinations 

Exclude patients with high intratumour heterogeneity? 


ctDNA and ultradeep sequencing to detect 
resistance early on and adapt therapy 

Identify feedback loops and combine therapies 
Combine with drugs that targets host 
(immunotherapeutics) and driving forces of tumour 
evolution (DNA repair deficiency) 


Abbreviations: ctDNA, circulating tumour DNA; NGS, next-generation sequencing. 


increase the magnitude of the antitumour effects associ- 
ated with such an approach; therefore, it is paramount to 
identify and target the actionable genomic driver events 
and differentiate them from passenger events. This is 
a substantial issue because (with the exception of ER, 
ERBB2 and PIK3CA) none of the genomic alterations 
reported in any of the sequencing projects** have been 
validated as a target in the preclinical or clinical settings. 
Driver alterations are expected to belong to the ‘cancer- 
related’ gene catalogue, proposed by the Broad Institute in 
2013.°° Also, these alterations are expected to be clonally 
dominant and to lead to alterations in protein function 
(including activating mutations, biallelic loss or high level 
amplification in a narrow amplicon). The definition of 
cancer-related genes together with the listed driver char- 
acteristics are an ongoing effort led by bioinformaticians 
to produce an algorithm that will aid in the identifica- 
tion of these bona fide drivers. Most of the bioinformatic 
methods to identify driver genes exploit the recurrence 
of alterations to identify those genes that, when altered, 
might confer a survival advantage to the tumour cells. 
In particular, methods such as Gistic2*” or RAE,** enable 
detection of loci with recurrent focal copy-number 
amplifications or losses, and identification of functional 
copy-number alterations. More recently, with the growing 
number of large-scale sequencing projects and the associ- 
ated catalogues of somatic mutations, computational bio- 
logists have focused on the development of methods for 


the identification of highly mutated genes. These methods 
aim to identify those genes that demonstrate features of 
positive selection by taking into account the mutational 
load of the gene and comparing it to its background 
mutational rate,*°*-*! together with additional features 
such as the gene-expression level and replication rate 
or the localization of the mutations and their predicted 
effects on the activity of the protein.” Importantly, these 
methods will also help categorize the somatic alterations 
as gain or loss of function, hence, determining the role of 
the gene as an oncogene or a tumour suppressor, even- 
tually leading to development of targeted therapies.” 
Finally, while these methods mainly exploit DNA alter- 
ations, it is important to take into account RNA expres- 
sion levels (determined for example by gene-expression 
array) that have been successfully used to identify points 
of addictions of the tumour cells and that should be inte- 
grated with DNA alterations for better predictions of 
targetable pathways in cancer.® 

The ER is the most validated target to date in breast 
cancer, where altered protein expression is the key 
feature, any new target should be defined by protein 
overexpression and/or pathway activation, rather than 
DNA alterations alone. Several arguments support the 
hypothesis that pathway and protein activation could 
be as important as DNA mutations in breast cancer. 
First, a significant proportion (around 50%) of patients 
with breast cancer do not present targetable molecular 
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alterations in cancer-related genes.” Second, activation 
of pathways and proteins assessed by gene expression” 
or protein phosphorylation is predictive of the efficacy 
of targeted therapy. Finally, the presence of a target does 
not necessarily lead to pathway activation. For example, 
patients with ERBB2 amplification present high levels of 
intertumour heterogeneity regarding AKT phosphory- 
lation. Despite this heterogeneity, there is a correlation 
between ERBB2 amplification and AKT phosphory- 
lation, but this is not always present in all cases of HER2- 
positive disease.® Overall, with a few exceptions, it is not 
possible to define a driver at the individual level in breast 
cancer, although some efforts are ongoing to provide bio- 
informatics tools that will overcome this issue. A strong 
rationale exists to support assessing pathway activation, 
rather than only DNA alteration, to identify potential 
targetable drivers in breast cancer. 


Targeting molecular alterations: the right drug 

The failure to deliver personalized medicine is often 
associated with the lack of highly bioactive and specific 
drugs. For example, PIK3CA mutations occur in approxi- 
mately 25% of breast cancers and have been reported as 
drivers of this disease.*” Nevertheless, the use of non- 
selective PI3K inhibitors led to modest response rates 
(4%) in early clinical trials when administered as mono- 
therapy. Second-generation, a-selective, PI3K inhibi- 
tors are more specific and produce better inhibition 
in vivo in animal models.” In fact, preliminary results 
with these new inhibitors indicate partial responses 
of around 6% in patients with PIK3CA mutant breast 
cancers, whereas no responses had been observed so far 
in patients with PIK3CA wild-type tumours.® Thus, it 
is crucial to develop drugs that hit the target with the 
highest specificity and bioactivity. 


Towards combination therapies 

Targeting a mutational driver should lead to an antitu- 
mour response. Nevertheless, intratumour heterogeneity 
could explain some primary resistance to therapy. The 
presence of two or more alterations in cancer-related 
genes has been associated with resistance to targeted 
therapies in vitro and in clinical trials.® Interestingly, 
Stephens et al.° reported that 67% of the breast cancer 
samples analysed had two or more genomic alterations. 
There are several illustrations of such an issue in breast 
cancer. For example, the coexistence of PIK3CA muta- 
tions and ERBB2 amplification has been associated with 
resistance to the HER2-targeting drugs trastuzumab 
and lapatinib,” providing the rationale to combine 
PI3K and HER2 inhibitors in different clinical trials. 
Interestingly, this resistance to anti- HER2 therapy 
owing to the presence of PIK3CA mutations have not 
been observed when dual HER2 blockade was achieved 
through the use of two monoclonal antibodies,” or in 
trials performed in the early setting, where the presence 
of PIK3CA mutations is already evident.” Moreover, in 
the phase III BOLERO2 trial, which evaluated the admin- 
istration of the mTOR inhibitor everolimus in combina- 
tion with the aromatase inhibitor exemestane in patients 
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with advanced-stage ER-positive/ HER2-negative breast 
cancer, the presence of multiple alterations in different 
pathways (that is, FGFR1/2, CCDN1, PTEN and PIK3CA) 
translated into resistance to mTOR inhibition.’ 

The identification of a driver is crucial for the success 
of targeted therapies, although, it is worth noting that 
secondary resistance will occur in the vast majority of 
patients presenting with advanced disease. This resist- 
ance is a result of tumour adaptation at the DNA or 
protein level under the selective pressure of therapy.”>”* 
The acquisition of ESR1 mutations during endocrine 
therapy is currently the best model of genomic evolu- 
tion in response to targeted therapy.'*'? Development 
of secondary resistance with the emergence of resistant 
clones could be a result of the high level of intratumour 
heterogeneity. Several reports emphasize the impor- 
tance of intratumour heterogeneity and clonal evolution 
in breast cancer patients. Cancers evolve by a process 
of clonal expansion, genetic diversification and clonal 
selection with complex dynamics and with highly vari- 
able patterns of genetic diversity and resultant clonal 
architecture. Therapeutic intervention might decimate 
cancer clones, but might also promote the expansion 
of resistant variants.’>”° From a clinical standpoint, 
early detection of resistance is crucial to optimizing 
therapy, and the use of ultradeep sequencing in mul- 
tiple regions of a biopsy, together with monitoring of 
tumour evolution using ctDNA,” could provide this 
remit.”” Moreover, some studies have reported hetero- 
geneity in terms of different metastatic sites exhibiting 
different genomic alterations,” and how to handle this 
level of heterogeneity is still unclear. One could argue 
that trials investigating precision medicine should assess 
genomic alterations only in metastatic sites presenting 
evidence of progressive disease, but not in those sites 
that are under control. This approach is being evalu- 
ated in an ongoing clinical trial (MATCH-R), which will 
prospectively investigate the evolution of clonal archi- 
tecture in tumours from patients that experience disease 
progression after an initial benefit from a targeted agent. 
These patients would undergo a biopsy of the progress- 
ing lesions in order to identify the mechanisms involved 
in resistance, and mouse avatars will be generated in 
some cases to identify the best ad-hoc treatment. At the 
very least, the presence of multiple drivers in different 
progressing metastatic sites should lead to the use of 
combination therapy. 

A second mechanism of tumour adaptation consists 
of the activation of alternative protein networks that will 
bypass target inhibition. This mechanism of resistance 
has been documented in patients treated with mTOR 
inhibitors, whereas a feedback loop lead by mTORC2, 
has been observed that results in AKT activation owing 
to growth factor receptor phosphorylation (including 
IGF-1R),” providing the rationale to combine these 
agents with either IGF-1R® or PI3K*! inhibitors. 

Ultimately, one could argue that secondary resistance 
can only be avoided by developing therapies that could 
lead to cancer cell eradication. As an illustration, the 
combination of targeted therapies and the stimulation of 
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Figure 3 | Development of MBC precision medicine. There are two main approaches to the implementation of personalized 
medicine in the management of advanced-stage breast cancer. a | A stratified medicine approach: a specific molecular 
alteration is determined in a tumour sample (pre-screening). Only those tumours harbouring the genomic aberration are 
treated with the drug targeting that specific alteration. In the case of negativity for this molecular aberration, potentially 
other alterations might be subsequently determined to inform selection of other targeted therapies. These targets might 
also be identified within the context of molecular screening programmes that use high-throughput technologies to identify 
several molecular alterations in parallel. The main limitation of this approach is the low frequency at which the majority of 
these genomic aberrations are present. b | A personalized medicine approach: the goal is to develop trials showing that a 
drug is effective in a cohort of patients defined by a genomic alteration, because this approach assesses if treatment 
individualization based on high-throughput genomics improves outcome in all comers as compared with standard therapy. 
The main challenge for this approach is determining the standard arm (such as targeted therapy but unselected or 
standard of care). Abbreviations: inh, inhibitors; MBC, metastatic breast cancer. 


immune checkpoints could enable eradication of tumour 
cells.” This concept is current being evaluated in a trial 
testing the combination of trastuzumab and pembro- 
lizumab, an anti-PD1 agent, in HER2-overexpressing 
breast cancers. 


Logistical and operational challenges 

Precision medicine includes two different approaches, 
namely, stratified medicine and personalized medi- 
cine (Figure 3). Stratified medicine (Figure 3a) consists 
of testing a drug in a population defined by a specific 
molecular alteration, whereas testing of personalized 
medicine (Figure 3b) investigates whether the concept 
of treatment individualization improves outcomes 
in ‘all comers. The screening of patients for inclusion in 
clinical trials is the major limitation for the develop- 
ment of stratified medicine in MBC, because most of 
the current candidate drivers are detectable in less than 
10% of the patients. If we consider the development of 
FGFR1 inhibitors for example, and assume the same sta- 
tistical analyses as in the BOLERO-2 trial,® in which a 
total of 528 progression-free survival (PFS) events were 
required in order to generate a hazard ratio of 0.74 with 
90% power, and assuming a median PFS of 3.7 months 
in the control group, and a 2:1 randomization ratio in 
favour of the investigational arm, a total 705 patients 
would need to be randomized. If we further assume that 
FGFR1-amplification is observed in around 10% patients 
and a screening failure rate of around 15%, then a total 
of 8,294 patients will be required to be screened to make 
this study feasible. 


This approach generates two major issues: first, when 
a patient is tested for a single gene alteration, its likeli- 
hood of being positive and, therefore, treated by a drug 
matched to a genomic alteration is very low. One solution 
to address this issue is to use high-throughput genomic 
approaches to detect all present genomic alterations and 
enable the patient to be assigned to a specific therapeutic 
trial. The second issue is the need to screen a large number 
of patients to perform a clinical trial. If we take the sce- 
nario of drug development in an AKT1-mutant popula- 
tion (4% of all breast cancers), there is a need to screen 
around 10,000 patients with MBC to be able to perform a 
randomized clinical trial that includes 400 patients. 


Drug development in rare genomic segments 

To overcome the issue of patient accrual in a strati- 
fied medicine trial, we propose five possible solutions 
(Figure 4). First, a need to scale-up the number of patients 
screened for a genomic alteration exists. One possible 
approach consists of developing large molecular screening 
programmes to feed downstream therapeutic trials with 
patients presenting with a candidate genomic alteration. 
Two different types of molecular screening programmes 
for stratified medicine are available: basket trials and 
umbrella trials. The basket trials test the effect of a single 
drug ona molecular alteration in a variety of cancers. This 
design allows not only for a faster identification of candi- 
date patients, but also to assess the potential value of this 
targeted therapy across different tumour types. The most 
informative example for breast cancer is a clinical trial 
that examined the administration of neratinib in patients 
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cancer. This trial will screen 10,000 patients to identify 
those who are suitable for inclusion in one of five ran- 
domized registration trials. So far, an example of such a 
large trial does not exist in breast cancer as most of the 
molecular screening programmes are developed by single 
institutions for inclusion in their own phase I trials.*® The 
only two large molecular screening programmes avail- 
able are the previously mentioned SAFIRO1 trial for breast 
cancer patients” and the BATTLE trial for NSCLC,” both 
of them ina single tumour type. 

A second solution to circumvent low incidence of tar- 
getable alterations consists of clustering several genomic 
alterations as a single predictor. As an illustration, in a 
randomized trial testing the FGFR-inhibitor dovitinib, 
FGFRI, FGFR2 and FGF3 amplifications were clustered 
within a single population called “FGF-pathway amplified 
breast cancer’ Following a similar principle, BRCA1 and 
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Figure 4 | Overcoming the accrual challenges of stratified medicine approaches. 
Different potential solutions include scaling-up molecular screening by using either 
less invasive and more accessible methods, such as ctDNA, or creating large 
international screening programmes; the development of novel preclinical models; 
the use of the preoperative setting in drug development to validate potential 
biomarkers or to determine the biological effect of a drug; and grant drug approval 
based on phase II clinical trials in rare genomics segments with worse outcome. 
Abbreviations: BC, breast cancer; ctDNA, circulating tumour DNA. 


who have solid tumours with activating ERBB mutations, 
including EGFR, HER2 and HER3. Several distinct cancer 
cohorts are included in this study: bladder/urinary tract 
cancer, breast cancer, colorectal cancer, endometrial 
cancer, gastric/esophageal cancer, ovarian cancer, all 
other solid tumours with a HER2 mutation, EGFR 
mutated and/or amplified primary brain cancer, and solid 
tumours with a HER3 mutation (NCT01953926).** This 
trial will define whether patients with ERBB2-mutated 
breast cancer are sensitive to the pan-HER inhibitor 
neratinib. Preliminary data for the non-small-cell lung 
cancer (NSCLC) cohort showed that for the 13 patients 
in the trial who received neratinib monotherapy, no 
patient experienced a partial response (PR), 7 (54%) 
patients achieved stable disease (SD) and 4 (31%) patients 
achieved clinical benefit (defined as a PR or SD for 12 or 
more weeks).*° Other basket trials have been developed 
recently, but are less relevant for breast cancer, owing to 
the scarcity of the molecular alterations that are investi- 
gated. These trials include the evaluation of vemura- 
fenib in BRAFY'-positive cancers (NCT01524978)* or 
crizotinib in patients with molecular alterations in ALK, 
MET or ROS genes (NCT02034981).*” The umbrella trials 
assess the effect of different drugs in different molecular 
alterations either in one or several tumours. Two umbrella 
studies have been specifically dedicated to breast cancer, 
the SAFIRO1 and the AURORA trials. The SAFIRO1 trial 
included 403 patients that assigned 52 (13%) patients 
to a targeted therapy matched to an identified genomic 
alteration.» The AURORA trial is an ongoing molecu- 
lar screening programme that will include 1,300 patients 
with advanced-stage breast cancer to perform high cover- 
age NGS and RNA sequencing on matched primary and 
metastatic samples.** Other examples of molecular screen- 
ing programmes that include breast cancers among other 
disease subtypes are also available. The NCI-MATCH 


BRCA2-mutation carriers are usually clustered together in 
clinical trials testing PARP inhibitors.” 

The third approach is to perform most of the drug 
development in the preoperative setting. The drug devel- 
opment process usually includes several trials for proof- 
of-concept, dose optimization, biomarker validation and 
development of combination therapies. Overall early 
stage breast cancer represents >80% of the breast cancers 
detected, thus several trials could be performed in the pre- 
operative setting rather than in patients with metastatic 
disease, which would speed up the process. 

A fourth potential solution would be the use of novel 
preclinical models (such as patient-derived xenografts 
and ex-vivo circulating tumour cell [CTC] cultures). As an 
example, studies using ex vivo culture of circulating breast 
tumour cells for individualized testing of drug suscepti- 
bility have been performed,” including a trial in CTC cul- 
tures from six patients with ER-positive breast cancer that 
were tumorigenic in mice, allowing for genome sequenc- 
ing of the CTC lines but also drug sensitivity testing that 
revealed potential new therapeutic targets.” The use of 
CTCs would enable the use of genome sequencing to 
identify the presence of pre-existing or newly acquired 
molecular alterations in the tumour, and enables drug 
sensitivity testing to identify the best therapies for 
individual cancer patients. 

Furthermore, drug approval based on phase I/II trials 
could dramatically speed-up drug development, as 
shown by the example of ALK inhibitors in lung cancer.** 
This approach would require the initial identification of 
molecular subgroups associated with very poor outcome. 
ESRI mutations have been associated with very poor 
outcome in patients with MBC™ and could constitute a 
population where drugs could be approved based on the 
results of phase II clinical trials. Of note, this strategy is 
effective only ifthe genomic segment associated with poor 
outcome also defines a population that is sensitive to the 
investigated targeted therapy. 
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Feasibility of biopsies and genomic tests 

While accrual represents the biggest challenge to develop 
stratified medicine for patients with breast cancer, other 
logistical considerations need to be addressed. The most 
relevant one is that it is impossible to perform genomic 
tests in a significant proportion of patients, owing to 
difficult tumour (metastatis) locations (such as bone 
metastasis)** and the low percentage of cancer cells in 
some samples. Several approaches to overcome these 
limitations are available. Firstly, the analysis of ctDNA” 
could replace the use of biopsies when these cannot be 
done, or when the tumour site is not compatible with 
DNA extraction. However, there is lack of robustness to 
quantify copy-number aberrations using this techno- 
logy. Performing a biopsy might be an issue not only in 
patients with tumours in difficult-to-access locations, 
and it seems likely that the capacity of interventional 
radiology facilities will not permit biopsy sampling in 
all patients. Secondly, performing in-depth sequencing 
could enable analyses of samples with a low percentage of 
cancer cells, as shown in the MOSCATO trial, where the 
introduction of NGS (1,000) increased the feasibility of 
widespread genomic testing.'® 


Clinical utility of multigene profiling in MBC 

Stratified medicine is currently the dominant model in 
breast cancer research, although, its sustainability in the 
long term is questionable. Indeed, with the evolution 
of technologies and advances in biology, rarer genomic 
entities will be detected, leading to ever more challenges 
for patient accrual and will make stratified clinical trials 
unfeasible. A more-appropriate model to develop pre- 
cision medicine in patients with MBC could consist of 
evaluating the concept of personalized medicine, rather 
than stratified medicine. In this model, the hypothesis 
of clinical research is that the use of a multigene pro- 
filing will improve outcomes in the whole population 
(Figure 3b). These trials compare a standard treatment 
arm to an arm in which stratification is tested based on 
molecular profiling. In patients with breast cancer, the 
SAFIRO2 trial® is being conducted to compare mainten- 
ance chemotherapy to eight different targeted thera- 
pies given according to an individual’s genomic profile 
(identified by NGS and CGH arrays). This trial, therefore, 
evaluates whether the overall approach of personalized 
medicine improves outcome as opposed to a standard 
maintenance therapy. Several other studies are testing 
this strategy; most of them including multiple tumour 
types.*° The main advantage of this approach is that it 
does not need to accrue thousands of patients in order 
to address a research question. Nevertheless, several key 
concerns about trial design remain unsolved. The first 
one relates to the choice of the standard arm. Indeed, the 
best standard arm should theoretically include similar 
treatments to those used in the personalized medicine 
arm, but given in a random way, in order to have the 
same drugs both in the standard and the personalized- 
treatment arm. This strategy is adopted in the NCI- 
MPACT trial” testing efficacy of personalized medicine 
across tumour types, including breast cancers. The 


second question relates to whether the effect observed 
in the overall population is an effect that is common all 
the drugs tested or only a few drugs. Thus, several statis- 
tical tools need to be developed to exclude this bias, such 
as interaction tests to exclude the hypothesis that some 
treatments are better than others. The sample sizes of 
these trials could be calculated accordingly. 

Other logistical and operational challenges exist, 
but they are currently not considered as major limiting 
factors to developing precision medicine, including the 
fact that a genomic alteration cannot be identified in 
every patient. In this context, the use of a large panel of 
genes could maximize the likelihood of a patient receiv- 
ing a matched therapy. Also, developing alternative 
technologies to identify drivers, including gene expres- 
sion or protein-based assays, would certainly increase 
the number of patients for which a driver is identified. 
For example, the WINTHER trial is performing gene- 
expression arrays in patients for which no DNA alter- 
ation could be detected to compare it to gene expression 
in normal (not cancerous) tissue to identify a potential 
treatment of interest.*” The difficulty of access to appro- 
priate or available drugs is an important secondary issue. 
Indeed, in the SAFIRO1 study, only 28% of the patients 
with a genomic alteration had access to the targeted 
therapy,”° either because a clinical trial testing a specific 
targeted agent would not accept patients with breast 
cancer or owing to the absence of available treatment 
slots. These issues indicate that genomic tests should 
be performed only in the context of a drug access pro- 
gramme, either in phase I units or in consortium with 
secured drug access. Finally, the modality of implemen- 
tation of precision medicine should also be considered. 
Indeed, two models of precision medicine are avail- 
able. In one model, some biomarker companies would 
perform the test whereas in the other one, public centres 
will conduct them for free. As an illustration of the latter, 
the French National Cancer Institute has set-up a group 
of 28 genomic centres located in academic hospitals to 
offer free and equal access to genomics for all patients 
with cancer.* 


Conclusions 

The concept of precision medicine driven by genomics 
for MBC is appealing; however, no evidence exists that 
it can improve patient outcome. This failure is a result of 
both scientific and logistical issues. The biggest issue in 
the field of breast cancer is the lack of well characterized 
drivers, with the exceptions of ER, ERBB2, PIK3CA and 
AKT1. Developing catalogues of cancer-related genes, 
together with methods for assessments of pathway acti- 
vation could enable a better identification of such drivers 
in the near future. The characterization of the genomic 
landscape of tumours and of the activated protein 
network will guide combination therapies to optimize 
therapeutic effects. Finally, logistics and operational 
challenges need to be addressed. In this regard, a shift 
towards personalized medicine rather than a stratified 
medicine approach could improve the management 
of patients. 
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BCLTIA is a triple-negative breast cancer gene with 


critical funct 


ions in stem and progenitor cells 
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Triple-negative breast cancer (TNBC) has poor prognostic outcome compared with other 
types of breast cancer. The molecular and cellular mechanisms underlying TNBC pathology 
are not fully understood. Here, we report that the transcription factor BCLTIA is overexpressed 
in TNBC including basal-like breast cancer (BLBC) and that its genomic locus is amplified in 
up to 38% of BLBC tumours. Exogenous BCL1JA overexpression promotes tumour formation, 
whereas its knockdown in TNBC cell lines suppresses their tumourigenic potential in 
xenograft models. In the DMBA-induced tumour model, Bcllla deletion substantially 
decreases tumour formation, even in p53-null cells and inactivation of Bcl7la in established 
tumours causes their regression. At the cellular level, Bc/1/a deletion causes a reduction in the 
number of mammary epithelial stem and progenitor cells. Thus, BCL11A has an important role 
in TNBC and normal mammary epithelial cells. This study highlights the importance of further 
investigation of BCLTIA in TNBC-targeted therapies. 
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heterogeneous nature of the disease'. TNBC accounts for 

around 15% of all breast cancer cases and in the absence 
of effective targeted therapies, TNBC patients tend to have a poor 
prognosis*~*. At the molecular level, several distinct subtypes of 
breast cancer have been identified based on the gene expression 
profiling>>°. The most commonly used classification describes 
six subtypes: luminal A, luminal B, Her2, claudin low, basal-like 
breast cancer (BLBC) and normal*®. More recently, analysis of 
large numbers of tumour samples as part of the METABRIC 
study identified 10 pathologically distinct subtypes known as 
integrative cluster (IC) 1-10 (ref. 5). The majority of TNBC cases 
(80%) have a BLBC’ or IC10 (ref. 5) gene expression signatures. 
In addition, cancer sequencing studies have identified mutations 
of p53, PTEN and BRCAI in TNBC**®?. However, driver 
oncogenic genomic aberrations in TNBC have not been 
comprehensively identified. 

The developmental hierarchies of the mammary epithelium 
and hematopoietic lineages share many similarities!? in that stem 
cells progressively give rise to lineage-restricted progenitors, 
which ultimately differentiate and generate all functional cells. 
A number of key hematopoiesis transcription factors have 
important roles in mouse mammary gland development and 
are human breast cancer genes!!!5. For example, the key 
regulator of T-helper-2 cell development, GATA3, is critical in 
luminal mammary cell development!”!° and is a luminal breast 
cancer marker gene!®. In this study we interrogated cancer 
genomics data focusing on a subset of important hematopoiesis 
factors and identified BCL11A as a novel TNBC oncogene. 


O ne of the major challenges in treating breast cancer is the 


Results 

BCLIIA is highly expressed in triple-negative breast cancer. In 
an attempt to identify potential TNBC oncogenes, we selected a 
list of genes known to have important roles in hematopoiesis 
and investigated their expression across the major molecular 


subtypes of breast cancer’, We first re-analysed a publically 
available microarray data set® and found that out of the 
examined genes, BCL11A was differentially and highly 
expressed in BLBC (Supplementary Fig. la). This is in sharp 
contrast to GATA3, which is highly expressed only in luminal 
subtypes (Supplementary Fig. la) and is a known prognostic 
marker for these tumours !°. 

We then investigated the expression of BCL11A in other 
patient data sets including METABRIC® and TCGA®, which 
between them have curated gene expression, copy number (CN) 
variation and clinical data from close to 3,000 patients”. 
Pathologically, we found that high BCL11A_ expression 
significantly correlated with TNBC pathology (Fig. la). At the 
molecular level, high BCL11A expression was also found to 
significantly correlate with the BLBC subtype in the METABRIC, 
TCGA and six other microarray data sets (Fig. 1b and 
Supplementary Fig. 1b). Quantitative reverse transcription PCR 
(qRT-PCR) analysis of BCL11A expression on a randomly 
selected subset of METABRIC tumours (all subtypes, n= 230) 
validated the above expression data (Supplementary Fig. 2a). In 
addition, we also found that high BCL11A expression in 
METABRIC samples correlated with the recently described 
IC10 cluster of tumours (Fig. 1c), thus further supporting the 
concordance between the BLBC and IC10 classifications. 
Consistent with TNBC cases, high BCLI1A_ expression 
was significantly correlated with a high histological grade 
(Supplementary Fig. 2b). 

Furthermore, high BCL11A expression in BLBC cases was 
further validated by immunohistochemistry (IHC) on a subset of 
the METABRIC tumours (all subtypes, n = 368. BLBC, n= 24). 
Strong BCL11A immunostaining was predominantly found in 
BLBC (Fig. 2a). Out of 24 BLBC samples examined from this 
subset, 16 scored positive for BCLI1A (Fig. 2a; details in 
Methods). In addition, samples stained positively in IHC also 
had higher RNA levels compared with those scored as negative 
(Supplementary Fig. 2c). 
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Figure 1 | BCL11A is highly expressed in TNBC. (a) Significant correlation between BCL7IA expression and the TNBC type of breast cancer in both 
METABRIC (n= 2,000) and TCGA (n=1,100) data sets—* indicates t-test P value<0.005. (b) BCLIIA expression across the six molecular subtypes of 
breast cancer (‘Normal’ refers to the PAM50 subtype) in both METABRIC and TCGA data sets—* indicates t-test P value< 0.005. (¢) The METABRIC 
samples distributed according to the ICs 1-10, showing the correlation between BCLTIA expression and IC10—* indicates t-test P value<0.005. 


2 NATURE COMMUNICATIONS | 6:5987 | DOI: 10.1038/ncomms6987 | www.nature.com/naturecommunications 


© 2015 Macmillan Publishers Limited. All rights reserved. 


NATURE COMMUNICATIONS | DOI: 10.1038/ncomms6987 


ARTICLE 


% Tumours positive 


13/58 


BCL11A IHC on METABRIC samples 


[29% | 8/27 


* HER2 


[io 13/133 fi5%4] 14/94 [25% | 8/32 


Lum B ~*~ Normal 


b c BCL11A copy number gains d 
407 METABRIC - PAM50 50 7 TCGA - PAM50 a 
= METABRIC integrative clusters 
* 
30 40 
2 2 2 
=I 5 30 5 
fo} fe} fe} 
5 20 5 5 
x se 20 x 
10 40 
0 0 
> oe Y e > > IC1 Ic2 IC3 C4 IC5 IC6 IC7 IC8 IC9 IC10 
& CS Ss & 
e f 9g 
METABRIC. METABRIC 
TCGA BCL11A expression BCL11A CN gains 
All tumours Basal breast-specific survival breast-specific survival 


O Homdel 
O Hetdel 
O Diploid 
O Gain 
O Amp 


O Homdel 
© Hetdel 
O Diploid 
O Gain 
O Amp 


BCL11A expression 


BCL11A methylation 


P=0.02 P=0.02 Neutral 


Normal exp 


— Highest 15% — Gain 


100 150 200 250 300 
Months 


0 50 100 150 200 250 300 0 


Months 


50 


Figure 2 | Genomic alterations at the BCL11A locus. (a) Images and scoring of BCLI1A IHC staining on a subset of tumours from the METABRIC study 
(see Methods for scoring)—* indicated y?-test P<0.0001. (b-d) Bar chart depicting the percentage of samples that harbour BCL7IA CN gains in each 
subtype in both METABRIC and TCGA data sets—* indicates 72-test P< 0.0001. (e) Scatter plots showing the methylation status of the BCLT7A locus in all 
tumours or basal only from the TCGA data set. The colour-coded legend indicated the BCL11A CN status for each tumour. (f-g) Kaplan-Meir plots showing 
the survival rate comparison between patients who have normal or high levels of BCL17A expression, or between patients with CN gains or without 


(neutral). 


One mechanism for the induction of high BCL11A expression 
in BLBC cases could be CN aberrations. From ~2,000 
breast cancer cases in METABRIC?, CN gains at the BCLI1A 
genomic locus were identified in 62 patients (Supplementary 
Fig. 3a), which also correlates with high BCL11A expression 
(Supplementary Fig. 3b). Importantly, out of these 62 patients 
with CN gains, 39 were classified as BLBC, which account 
for 18.6% (39/210) of the total BLBC cases in METABRIC 
(Fig. 2b). Examination of the TCGA data set revealed that 38% 
(31/81) of BLBC samples have BCL11A CN gains, which is 
again significantly correlated with higher gene expression (Fig. 2c 
and Supplementary Fig. 3c). A similar result was also found 
when the METABRIC data was analysed using the integrative 
clustering, with 15.6% of IC10 samples having BCL11A CN 
gains (Fig. 2d). 


Further analysis of the TGCA data set revealed that in BLBCs, 
the BCL11A locus is almost exclusively hypomethylated and this 
is correlated with high expression levels (Fig. 2e). There was also 
no correlation between BCL11A CNs and the methylation status. 
This result suggests that epigenetic changes at the BCL11A locus 
could be another mechanism that contributes to its high 
expression in BLBC. Given the strong correlation with TNBC, 
patients with either high expression or CN gains of BCL11A 
had poor survival rates compared with the rest of the cohort 
(Fig. 2f, g). A similar trend was also observed in four other patient 
data sets!7-7° (Supplementary Fig. 3 f-i). In particular, patients 
with CN gains of BCL11A had a higher rate of relapse and 
metastasis and a lower rate of survival (Supplementary Fig. 3d-e). 
The utility of BCL11A expression/CN as a biomarker in the clinic 
thus warrants further investigation. Indeed, the future release of 
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patient outcome for the complete TCGA cohort will aid in 
clarifying this finding. 


High levels of BCL11A promote tumour development. 
Although BCL11A is involved in rare B-cell lymphomas and is 
able to transform fibroblast cells in vitro”!??, the cellular and 
molecular mechanisms of BCL11A-mediated tumourigenesis 
remains unclear. To address this, we first tested whether 
BCLI11A overexpression could promote the colony formation or 
tumour development in mammary epithelial cells. We over- 
expressed BCL11A in immortalized non-tumourigenic mouse 
EpH4 (ref. 23) or human HMLE*** cells (Supplementary Fig. 4a) 
and performed Matrigel and suspension mammosphere assays. 
Forced BCL11A expression in both EpH4 and HMLE (EpH4-11A 
and HMLE-11A) cells resulted in double the number of spheres 
compared with their respective control cells (Fig. 3a-b). 
Furthermore, mouse EpH4-11A cells injected orthotopically in 
cleared mammary fat pads of immune-compromized NOD/SCID/ 
IL2ry ~/ — (NSG) mice formed larger and palpable tumours 
compared with control cells (n= 6) (Fig. 3c and Supplementary 
Fig. 4b). Similarly, three out of four mice injected with HMLE-11A 
cells developed tumours within 8 weeks of injection (Fig. 3d and 
Supplementary Fig. 4c) suggesting that elevated levels of BCL11A 
promote tumour development. Moreover, gene expression 
analysis of these three tumours along with the 2,000 tumours 
from the METABRIC study clustered them with the BLBC 
subgroup (Fig. 3e). 


Knockdown of BCL11A reduces tumourigenicity of TNBC 
cells. Analysis of BCL11A expression in a panel of breast cancer 
cell lines revealed that BCL11A is highly expressed in TNBC lines 
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but is undetectable in any of the luminal cell lines tested 
(Supplementary Fig. 5a). Next, we assessed if disrupting BCL11A 
expression could affect the clonogenic and oncogenic potential of 
the TNBC cell lines. To inactivate BCL11A in these cells, we 
performed shRNA knockdown experiments (Supplementary 
Fig. 5b) in the TNBC cell lines 4T1 (mouse), MDA231, 
SUM159 and HMLER (human). Knockdown of BCL11A had no 
significant impact on cell viability, cell cycle kinetics or cell death 
(Fig. 4a-c and Supplementary Fig. 5b,d). However, BCL11A 
knockdown significantly reduced the clonogenic capacity of all 
four cell lines (Fig. 4d-f and Supplementary Fig. 5c). To assess 
tumourigeneic potential, BCL11A knockdown cells were injected 
subcutaneously into NSG recipients. Robust tumours developed 
from the control 4T1, MDA231, SUM159 and HMLER cells 
within 25 days. In contrast, the BCL11A knockdown cells pro- 
duced tumours of significantly reduced sizes (Fig. 4g-i and 
Supplementary Fig. 5c). Furthermore, primary and secondary 
limiting dilution transplantations of MDA231 control or shRNA1 
cells revealed a reduction in the number of tumour-initiating 
cells during the secondary transplants from 1/123 to 1/667 
(Supplementary Fig. 5e). 


Bcll11a is required for the development of DMBA tumours. To 
examine the role of BCL11A in mammary tumour development 
in vivo, we generated Bcllla conditional knockout (cko) mice 
(referred to as flox/flox; Supplementary Fig. 6a), as germline 
deletion of Bcl1 1a causes neonatal lethality?” and crossed them to 
the inducible Rosa26-CreERT2. As a tumour model, we used the 
potent carcinogen DMBA (7,12-dimethylbenz(a)anthracene) in 
combination with medroxyprogesterone acetate (MPA) to 
promote TNBC-like tumours in the mouse”®??. To minimize 
the effects of Bcllla deletion on non-mammary tissues, we 
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Figure 3 | High levels of BCL11A enhance clonogenicity of mammary epithelial cells and promote tumourigenesis. (a) Comparison of colony numbers in 
matrigel from EpH4-11A cells or from the control cells. Data are presented as meant s.d. (n= 3). Image on the right are depicting EpH4-control and EpH4- 
11A mammospheres grown in Matrigel (scale bar, 100 um). P value indicates student's t-test. (b) Comparison of the number of floating mammospheres 
formed from human HMLE-11A cells or the control cells. Data are presented as mean +s.d. (n= 3). Images on the right are of floating mammospheres 
formed by HMLE-control and HMLE-11A-expressing cell (scale bar, 200 um). P value indicates student's t-test. (¢) Graph depicting the size difference 
between tumours at 6 weeks after injection of EpH4-control and EpH4-11A cells orthotopicaly into contralateral mammary fat pads. Data are presented as 
mean +s.d. (n= 6). P value indicates student's t-test. (d) Kaplan-Meier survival curve depicting the percentage of tumour-free mice injected with either 
HMLE-control or HMLE-11A cells (n= 4). (e@) Unsupervized clustering of the HMLE-11A tumours in the mouse with human tumours from the METABRIC 
study based on the PAMSO (ref. 3) gene expression. Nearest centroid correlation score is plotted against the various subtypes for all three tumours. 
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Figure 4 | BCL11A knockdown in TNBC cells reduces tumour development. (a) Western blot showing BCL11A knockdown efficiency in 4T1, SUM159 and 
MDA231 cells transfected with control (scramble), shRNA1 or shRNA2 vectors. (b) MTS cell viability assay (see Methods) shows that BCL11IA knockdown 
does not affect cell viability in culture. (¢) EdU cell cycle analysis showing that Bcl1la knockdown does not affect cell cycle kinetics in all tested cell lines. 
Data are presented as mean +s.d. (n= 3). (d-f) Comparison of colony numbers from control, shRNA1 and shRNA2 in 471, SUM159 and MDA231 cells. 
Data are presented as mean +s.d. (n= 3). P value indicates student's t-test. (g-i) Graph depicting the reduction in tumour size observed when shRNAI1 or 
shRNA 2 transfected 4T1, SUM159 or MDA231 cells are injected subcutaneously into mice compared with control. Image under graph shows the actual 


tumours measured (scale bar, 5mm). Unpaired T-test was performed on d- 


transplanted mammary tissue from 8- to 12-week-old control 
(wild type) or flox/flox virgin female mice into contralateral 
cleared fat pads of female NSG mice followed by DMBA muta- 
genesis as illustrated in Supplementary Fig. 6b. By week 15, after 
the last dose of DMBA was administered, palpable tumours were 
visible in the mammary glands engrafted with the control 
mammary cells, but not with the flox/flox cells (Fig. 5a). By week 
22 post DMBA treatment, all control cell engraftments (8/8) 
developed tumours compared with only one from flox/flox 
mammary cells (1/8) (Fig. 5b). qRT-PCR analysis of this tumour 
revealed expression of Bcllla probably owing to incomplete 


i and * indicates P<0.05 and ** indicates P< 0.005. 


Cre-loxP recombination (Supplementary Fig. 6c, sample T1). 
Also, qRT-PCR and IHC results revealed that tumours 
upregulated Bcllla expression in response to DMBA-induced 
carcinogenesis (Supplementary Fig. 6c-d). These data thus 
reveal a requirement for Bcllla in DMBA-induced mammary 
tumourigenesis. 

To investigate Bcllla oncogenic activity in the DMBA model 
further, we performed the DMBA mutagenesis experiment using 
Trp53flox/flox*® (p53 single cko) or Bcl1laflox/flox/p53flox/flox 
(cko alleles for both p53 and Bellla or Dflox/flox) mammary 
tissues. In the recipients transplanted with Trp53flox/flox cells, 
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Figure 5 | Bcl11a is required in DMBA-mediated tumourigenesis. (a) Mouse images and fat pads whole-mount fat pads of either WT, flox/flox, pS3flox/ 
flox or Dflox/flox mammary cells. (scale bar, 500 um) (b) Quantification of the tumours with DMBA-mediated tumour development from the four depicted 
groups of engrafted cells over a period of 26 weeks. A log-rank (Mantel-Cox) test was used to compare the two groups and calculate the P value. 

(¢) Tumour size quantification of DMBA-mediated tumours in WT, +/flox and flox/flox mice. Mice were checked regularly for tumour development and 
once detected, Cre activation was induced using three injections of tamoxifen (first day of injection is indicated by black arrow). Tumours size was then 


monitored for up to 20 days or until they reach a critical size. 


palpable tumours were detectable as early as 4 weeks after the last 
injection of DMBA, and most tumours were detectable by week 
10 (Fig. 5b; n= 16). However, deletion of Bcllla together with 
p53 in Dflox/flox mice severely delayed tumour development with 
only 4 out of 16 mice developeing tumours by week 17 (Fig. 5b). 
This result indicates that BCL11A is a potent oncogene and is 
required in concert with p53 for tumour development. 


Bcllla is required for the maintenance of DMBA tumours. 
Although Bcllla is important for DMBA-induced mammary 
tumour formation, it is more clinically relevant if it has functions 
in mammary tumour progression and maintenance. We thus 
performed the DMBA mutagenesis on WT, flox/+ and flox/flox 
mammary epithelial cells before the induction of Bcl11a deletion. 
Only when mammary tumours were detected and measured, the 
mice were then injected with tamoxifen to induce Bcll1a deletion. 
As shown in Fig. 5c, deletion of Bcllla caused a significant 
reduction in tumour size as soon as 5 days post deletion. On 
contrary, tumours from the control heterozygous donor cells 
continued to grow post tamoxifen injection (Fig. 5c). The 
requirement of Bcllla in the established mouse mammary 
tumours is consistent with the decreased tumourigensis of 
BCL11A knockdown breast cancer cells and underscores its 
candidature for therapeutic development. 


Bcllla is required for mammary stem and progenitor cells. 
To understand the biological function of Bcllla in healthy 
mammary epithelial cells, we generated a Bcllla-lacZ knock-in 
mouse to determine the temporal and spatial expression of 
Bcllla in the mammary gland (Supplementary Fig. 7a). X-gal 
staining of the reporter embryos revealed that Bcllla was 
expressed in the mammary placodes from 12.5dpc (Fig. 6a). 
At puberty, Bcll1a was expressed in the cap cells of the terminal 
end buds, a region thought to harbour stem cells*! (Fig. 6b). 
During adult mammary gland development, Bcllla exhibited a 
dynamic expression pattern with a marked increase at early 
gestation and a gradual decline towards lactation and involution 
(Supplementary Fig. 7b). qRT-PCR analysis of RNA samples 
from several mammary epithelial compartments**? detected 
higher levels of Bcllla expression in the luminal progenitors 
(CD49b* /CD24"'), the basal cells (CD49F"/CD24*) and the 
mammary stem cell (MaSC) (CD49Fhi/CD24™4)_ enriched 
population (Fig. 6c). 

We next induced Bcll1a deletion and analysed the mammary 
epithelial fluorescence-activated cell sorting profile 3 weeks post 
deletion. The basal mammary epithelial cells from the flox/flox 
mice appeared to be depleted, and in particular the MaSC fraction 
(Fig. 6d). In addition, Bcllla deletion caused a significant 
decrease in the number of luminal colony-forming cells (CFCs) 
(Supplementary Fig. 7c). To functionally demonstrate loss of 
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Figure 6 | Expression and critical roles of Bcl1la in mouse MaSCs and progenitors. (a,b) X-gal staining of a 12.5dpc embryo and whole mount of the 
mammary gland from a 5-week-old Bclt1a-!°/+ female virgin mouse. The dashed circles highlight the mammary placodes and the terminal end buds. LN: 
lymph node. (scale bar, 500 um) (e) qRT-PCR for Bcl71a in different mammary epithelial cell compartments that were fluorescence-activated cell sorting- 
purified using antibodies for CD24, CD49f and CD49b. Data are presented as mean +s.d. (n= 3) and t-test was performed and the P values are displayed 
on the plot. (d) Depletion of the MaSCs-enriched population (cD24™e4cD49f"!, dashed lines) in the BclTla-deficient mammary gland detected by flow 
cytometric analysis. (e) Limiting dilution transplant (fat pad) assay showing severely compromized engraftment of Bcl7la-deficient MaSCs. Stem cell 


frequency calculation is described in the Methods. 


MaSC activities upon Bcllla deletion and to determine that the 
defects are cell-autonomous, we transplanted control and flox/flox 
cells at limiting dilution into cleared fat pads of NSG mice (see 
Methods). We found approximately sixfold reduction in stem cell 
frequency from 1/483 to 1/2859, in the Bcllla-deficient 
mammary gland (Fig. 6e). Reduction of MaSCs and progenitors 
in the Bcl11a-deficient mammary gland was also reflected in the 
altered expression of the MaSC gene expression signature** 
(Supplementary Table 1) (Supplementary Fig. 7e). 


Discussion 

We have demonstrated here that the transcription regulator 
BCL11A is a novel breast cancer gene. By investigating cancer 
genomics data from ~ 3,000 patients (METABRIC and TCGA), 
BCL11A was significantly expressed at higher levels in TNBC and 
particularly in BLBC/IC10 tumours both at RNA and protein 
levels. Experimentally, we have shown that disrupting BCL11A 
expression in TNBC cell lines and in the mouse significantly 
reduced tumour development and maintenance. At the cellular 
level, Bcllla is expressed and required in both MaSCs and 
luminal progenitor cells in the mammary gland. Lineage tracing 
experiments in the future will determine if Bcl11a is expressed in 
the recently identified lineage-restricted luminal and_ basal 
progenitor cells*> or in the bipotent MaSCs*°. Importantly, 
given the recent implication of luminal progenitors as the ‘cell of 
origin’ of BLBC*”%, it will be important to ascertain if Bcllla 


upregulation in luminal progenitor cells is one of the earliest steps 
in TNBC development. 

In addition, it will be important to identify how BCL11A is 
transcriptionally regulated and what are its downstream targets in 
TNBC. In erythrocytes, KLF1 has been shown to affect BCL11A 
expression*’, while in non-small cell lung cancer MIR30A has 
been suggested to regulate BCL11A expression*?. We found no 
correlation between KLF1 or MIR30A and BCL11A expression in 
the TCGA data set (Supplementary Fig. 8), suggesting that 
BCLI1I1A regulation could be context dependent. In terms of 
downstream targets, in leukaemia, it has been shown that 
BCL11A abrogates p21 transcription possibly via direct 
regulation of SIRT1 (refs 41,42). Previous work from our lab 
also showed that in B cells, BCL11A induces MDM2 expression, 
which is a negative regulator of p53 (ref. 43). However, the TCGA 
data does not indicate a strong correlation between BCL11A and 
SIRT1 or MDM2 expression at least in the tumour context 
(Supplementary Fig. 8). Therefore, identifying the putative 
BCLI1I1A regulators and its downstream targets in the breast 
epithelial cells should clarify its molecular and cellular roles in 
TNBC. 

In conclusion, through cancer genomics, in vitro assays, 
experimental xenograft models and mouse genetics, we have 
demonstrated in this study that BCL11A is a new breast cancer 
gene and a critical regulator in normal mammary epithelial 
development. These results warrant further investigation of 
BCLIIA as a potential candidate for TNBC-targeted therapy. 
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Methods 

Mouse strains and breeding. All experimental animal work was performed in 
accordance to the Animals (Scientific Procedures) Act 1986, UK and approved by 
the Ethics Committee at the Sanger Institute. Bcll1a bacterial artificial chromo- 
somes (BACs) were identified from the 129/SvJ mouse BAC library (Sanger 
Institute) and used to generate the Bcllla-lacZ- and Bcllla cko-targeting vectors. 
For Bcllla-lacZ reporter, targeting construct (Supplementary Fig. 7a) was 
generated based on the recently published strategy“. For the Bell1a cko mouse, 
targeting construct (Supplementary Fig. 6a) was generated based on the 

original recombineering strategy*’. Gene targeting in embryonic stem (ES) cells 
and chimera production were performed according to the standard procedures. 
The Bcllla cko line was then crossed to the Rosa26-Cre-ERT2 mouse line 
described previously*®. The p53 cko line was described previously*”. Homozygous 
p53 cko mice were crossed to the Bcll1a/Cre-ERT line described above and the F1 
generation was mated to generate mice doubly conditional for Bcllla and p53. 
Genotyping was confirmed using the primers listed in Supplementary Table 2. Cre 
activation was mediated by three injections of 1 mg tamoxifen per mouse over 3 
days. 


Mammary epithelial cell isolation and analysis. Mammary epithelial cells were 
dissociated using a mixture of collagenase (Roche) and hyaluronidase (Sigma), and 
cells were stained using the following primary antibodies: biotinylated anti-CD45 
(clone 30-F11; eBioscience, 1:500), anti-Ter119 (clone Ter119; eBioscience, 1:500) 
and anti-CD31 (clone 390; eBioscience, 1:500); anti-CD24-R-phycoerythrin (PE; 
clone M1/69, eBioscience, 1:500), anti-CD49f-Alexa Fluor 647 (AF647; clone 
GoH3, eBioscience, 1:100), anti-CD49b-Alexa Fluor 488 (AF488; clone HMa?2; 
eBioscience, 1:500) and Scal-Alexa Fluor 647 (AF647; clone D7, eBioscience, 
1:500). Secondary antibodies used: Strepavidin-PE-Texas Red (PE-TR; Molecular 
Probes, 1:500). Apoptotic cells were excluded by elimination of propidium iodide- 
positive cells. Flow cytometric analysis was done using CyAN ADP (DakoCyto- 
mation) and all sorts were performed using MoFlo (DakoCytomation) and gates 
were set to exclude > 99.9% of cells labelled with isoform-matched control anti- 
bodies conjugated with the corresponding fluorochromes. For whole-mount ana- 
lysis, abdominal glands (no. 4) were spread out using forceps on a glass slide and 
incubated in Carnoy’s fixative overnight. The slide was washed in water and placed 
in carmine alum (Sigma) stain overnight. The slide was again washed with ethanol 
and cleared in Xylene for 1 day before documentation. For histological analysis, 
abdominal glands were fixed in 4% formaldehyde in PBS for 24h at room tem- 
perature. The glands were transferred to 70% ethanol and stored at — 20°C until 
embedding and sectioning. All tissues were embedded in wax and sectioned at 

5 tm before being stained with haematoxylin and eosin. 


Mammary CFC assay. For colony-forming assays, the medium used was (human) 
NeuroCult NS-A Proliferation Medium (StemCell) supplemented with 5% 

fetal bovine serum, 10g ml~! epidermal growth factor (Sigma), 10 ng ml ~ ! 
basic fibroblast growth factor (Peprotech) and N2 Supplement (Invitrogen); the 
cultures were maintained at 37 °C/5% CO) for 7 days; then fixed using ice-cold 
acetone/methanol (1:1) and visualized using Giemsa staining (Merck). Lin~ 
CD24>1CD49b+ luminal progenitors from the flox/flox mammary gland were 
sorted and plated with irradiated feeders in colony-forming assay medium for 

6 days before the number of mammary CFCs was enumerated. 


Transplantation of mammary epithelium. Mammary epithelial cells (basal 
fraction) from tamoxifen-injected and non-injected flox/flox or flox/+ mice were 
sorted based on CD24/CD49f and transplanted in limiting doses (500/750/1,000/ 
2,000 cells) into cleared fat pads of 3-week-old NSG females. In each case, non- 
injected and tamoxifen-injected epithelial cells were engrafted into contralateral 
glands of the same recipient mice. The recipient mice were impregnated 3-6 weeks 
after transplant and outgrowths produced were dissected, stained with carmine and 
scored. Stem cell frequency was calculated using L-Calc (StemCell Technologies). 


DMBA/MPA tumourigenesis protocol. Mammary fragments were transplanted 
into cleared fat pads of 3-week-old NSG mice. At the time of surgery, the MPA 
slow release pellet (Innovative Research of America) was also implanted sub- 
cutaneously. The mice were allowed to recover for 2 weeks and then Bellla 
deletion was induced using three injections of tamoxifen. One week after deletion 
of Bcllla, 1mg of DMBA (Sigma) was administered orally; this was followed by 
three further doses of 1 mg of DMBA over 3 weeks. Mice were then examined 
weekly for tumour incidence and killed when tumours reached the legal limit. 


Transfection and mammosphere assays. EpH4 (gift from Professor Christine 
Watson) and MDA231 (ATCC) cells were cultured to confluence in 1:1 DMEM:F12 
(Invitrogen) media containing 10% fetal calf serum (FCS; FetalclonelII, Clonetech). 
4T1 (ATCC) cells were cultured in Roswell Park Memorial Institute (RPMI) media 
(Invitrogen) containing 10% FCS, and SUM159 cells (gift from Dr Charlotte 
Kuperwasser) were cultured in Ham’s F12 (Sigma), 5% FCS, insulin (5 1g ml " 
Sigma) hydrocortisone (1 pg ml~}, Sigma) and 1 x Penicillin Streptomycin Gluta- 
mine (PSG) (Gibco). HMLE and HMLER cells (gift from Professor Robert Weinberg) 


were cultured in complete HuMEC media (Invitrogen). The control or the Bell la 
overexpression piggyBac vectors were delivered into cells using the Amaxa Basic 
Nucleofactor Kit for primary mammalian epithelial cells (Lonza) according to the 
manufacturer’s recommendations. Transfected cells were maintained at 37 °C/5% 
CO, for 48 h. Cells were then cultured in puromycin (1-5 jg ml ~ +) for 48 h to allow 
for selection. To induce BCL11A expression in EpH4 and HMLE cells, doxycycline 
(Clonetech) was used at a final concentration of 1.0 1g ml~ !. Floating or Matrigel- 
embedded mammosphere were cultured and passaged as previously described?” in 
ultra-low attachment plates (Corning). 


RNA knockdown. BCL11A shRNA sequences were obtained from the TRC con- 
sortium*® (TRCN0000033449 and TRCN0000033453) were cloned into piggyBac 
transposon vector (PB-H1-shRNA-GFP). 4T1, SUM159, MDA231 and HMLER 
cells were transfected with 4.0 1g of piggyBac vector using Amaxa Basic 
Nucleofactor Kit for primary mammalian epithelial cells (Lonza) and GFP* cells 
were sorted/analysed 24-48 h later. 


RNA extraction and real-time PCR analysis. RNA from sorted cells was 
extracted using PicoPure RNA isolation kit (Molecular Devices) according to the 
manufacturer’s instructions. RNA from mammary tissue and cell lines was 
extracted using Tri-Reagent (Invitrogen) according to the manufacturer’s 
instructions. Complementary DNA (cDNA) was synthesized from 1 to 2 1g of total 
RNA using the Transcriptor Reverse Transcription cDNA Synthesis Kit (Roche). 
RT-PCR was performed using Hi-Fidelity Extensor mix (Thermo) using primers 
listed in Supplementary Table 2. Quantitative real-time PCR detection of cDNA 
was performed using SYBR Green Master Mix (Sigma, ABI and Invitrogen) 
according to supplier’s recommendations. The real-time PCR reactions were run 
in ABI-7900HT (Applied Biosystems) in triplicate. Primers used for real-time 
PCR on mouse samples were designed using PrimerBank* website (http:// 
pga.mgh.harvard.edu/primerbank/) and listed in Supplementary Table 2. All 
primers were purchased from Sigma-Aldrich. For real-time PCR on human 
samples Taqman gene expression probes (Life Technologies) were used. 


Cell cycle analysis. A total of 150,000 control or BCL11A knockdown cells were 
seeded in six-well plates and allowed to recover for 48 h. Cells were then incubated 
with 51M Edu (Invitrogen) for 1h. Cells were fixed and assayed using the EdU 
flow cytometery detection kit (Invitrogen) following the manufacturer’s 
instructions. 


Annexin v assays. A total of 100,000 control or BCL11A knockdown cells (in 
triplicates) were seeded in six-well plates and allowed to recover for 48 h. Cell were 
then collected and stained using the Annexin-V-AF647 (BioLegend) following the 
manufacturer’s instructions, and cells were then quantified using fluorescence- 
activated cell sorting. 


Cell viability assay. A total of 1,000 control or BCL11A knockdown cells (in 
triplicates) were seeded in 96-well plates and allowed to recover for 48 h. Cells were 
then incubated with CellTiter Aqueous One Solution (Promega) for 4h following 
the manufacturer’s instructions. Absorbance was then measured at 490 nm using a 
plate reader (Bio-Rad). 


Western blotting and IHC. Protein samples were prepared as described pre- 
viously!4 and probed using anti-Bcllla (Abcam, Clone 14B5, 1:1000) and Actin 
(Cell Signalling, 1:10000). For IHC analysis, BCL11A (Abcam (14B5, 1:50); CK14 
(Abcam; 1:100) and ERa (SCBT; 1:50) were used. Staining was detected using 
AF488- or Cy3-conjugated secondary (Sigma) and bisbenzimide-Hoechst 33342 
(Sigma). Fluorescence microscopy was carried out using a Zeiss Axiophot 
microscope equipped with a Hamamatsu Orca 285 camera, with images visualized, 
captured and manipulated using Simple PCI 6 (C Imaging Systems). The 
hematoxylin- and eosin-stained samples were visualized on a LEICA light 
microscope, while the mouse mammary gland whole mounts were visualized using 
the LEICA MZ75 light microscope. 


Microarray analysis. The intensity value for each probe set was calculated and the 
average of each gene was computed before the data analysis. For the quality control 
(QC) step, a set of intensity value of control genes were examined. All data were 
normalized and scaled by Partek Genomic Suite 6.4. Principal components analysis 
was performed to show the distribution of samples, eliminating outliers. Differ- 
entially expressed genes were selected by one-way analysis of variance by the factor 
of KO versus wild type, P value < 0.08. Hierarchical clustering of selected genes was 
performed to show the expression pattern. The resulting genes then underwent a 
pathway analysis (GeneGO: http://www.genego.com) to determine the biological 
significance of the data. 


Xenograft tumourigenesis assays. One hundred thousand EpH4, HMLE, 4T1, 
MDA231, SUM159 or HMLER cells were suspended in 25% Matrigel (BD 
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Innovative proton beam therapy for 
breast cancer and pancreatic cancer: the 


Medipolis experience 


he Medipolis Proton Therapy and 

Research Center (MPTRC, www. 

medipolis-ptrc.org/english/about. 
html) (Figure 1) is part of Medipolis Ibusuki, a 
project developed jointly by the government, 
industry and academia in Ibusuki, a city 
located in Kagoshima Prefecture, Japan. The 
aim of Medipolis lbusuki is to share cutting- 
edge medical knowledge with the rest of the 
world, while offering holistic treatment that 
aims to support every aspect of our patients’ 
recovery in a setting that is among the best 
of Japan’s hot spring resorts with a hotel and 
sports facilities. 

MPTRC, the core medical facility of 
Medipolis lbusuki, first offered proton beam 
therapy for people with cancer in 2011. It is 
now developing new proton beam therapies 
for cancers that are difficult to treat by X-ray, 
including pancreatic cancer’ and breast 
cancer’. By offering this service in a resort- 
type setting, it aims to offer compassionate 
and uplifting medical care while continuously 
improving quality and safety. 

Radiation therapy with X-rays is a valuable 
treatment option for many cancers but 
its role is limited by the risk of damage to 
organs adjacent to the tumour site that can 
sometimes be life threatening. This is because 
X-rays are highly penetrating, imparting 
ionizing energy to cells as they pass through 
the skin and tissues. This is not always a 
limiting factor: in the case of prostate cancer, 
for example, treatment options include radical 
prostatectomy, standard radiation therapy and 
proton therapy. 

By contrast, X-ray therapy is inappropriate 
for early-stage breast cancer because even 
targeted treatment will affect important 
organs adjacent to the tumour but outside 
the targeted area, such as the heart and 
lungs. Early breast cancer has therefore 
conventionally been treated locally by surgery. 

Proton beam therapy provides a means of 


delivering radiation to minimize its impact 
outside the bounds of the tumour. Unlike 
X-rays, protons release most of their energy 
only when they stop — a phenomenon known 
as the Bragg peak effect. The behaviour and 
direction of a proton beam can be precisely 
controlled so that protons stop at the tumour 
site, minimizing the exposure of adjacent 
tissues to ionizing energy. At MPTRC, the 
treatment rooms have rotating gantries that 
allow the proton beam to be applied from any 
direction (Figure 2). It is therefore possible 
to treat the tumour site without affecting 
adjacent critical areas like the lungs and heart. 
Research on proton beam therapy to treat 
small malignant tumours of the breast began 
at MPTRC in 2011. The biggest challenge 
was to immobilize the breast tissue so that 
the tumour could be targeted accurately. 
To date, most treatment facilities have used 
a specialized harness like a brassiere, but 
researchers at MPTRC developed a unique 
stabilization system called the Medipolis 
Technique (MPT) which maintains the breast 
in a fixed position. MPT was first used in the 
treatment of breast cancer in June 2015. 


Proton beam therapy for 
early-stage breast cancer 
Mastectomy and axillary dissection were 
once the primary procedures to treat early 
breast cancer. Breast-conserving therapy 
(BCT) is now widely performed and in 2010 
approximately 60% of cases of breast cancer 
were treated with BCT in Japan’. The rates of 
disease-free survival and overall survival for 
BCT with partial mastectomy are similar to 
those achieved with whole-breast irradiation 
or mastectomy’. Limited surgery, avoiding 
axillary lymph node dissection, is now more 
frequently offered to patients with confirmed 
negative sentinel lymph node biopsy (in 
whom tumour cells are not detected in the 
lymph nodes closest to the breast). 
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Figure 1 | The Medipolis Proton Therapy and Research Center (MPTRC). The center is located in the resort area of Kagoshima Prefecture. The white 


building (yellow circle) is MPTRC, and the brown building is the resort hotel. 


After partial mastectomy, 67-86% of breast 
cancer recurrence occurs in the peritumoural 
bed region. Furthermore, ten-year survival 
and distant metastasis rates are higher when 
recurrence is located within 3 cm of the 
tumour bed and the secondary tumour is 
histopathologically identical to the primary 
tumour’. This suggests that partial breast 
irradiation (PBI) delivered to the peritumoural 
bed region may be sufficient to reduce post- 
partial mastectomy recurrence. PBI is delivered 
by brachytherapy, external beam irradiation 
and intraoperative radiotherapy. 

PBI was evaluated in the RTOG 0319 trial, 
which reported a five-year local recurrence 
rate of 2.7%°. This compares favourably with 
outcomes after whole breast irradiation, 
for which the five-year local recurrence is 
approximately 3-5%. RTOG 0319 also showed 
that PBI was well tolerated, with most adverse 
effects being mild to moderate (grade 1, 42%; 
grade 2, 21%), and a low incidence of severe 
adverse effects (grade 3, 2%; grade 4, 0%)>. 

The MPTRC is now conducting a Phase I/II 
clinical trial of the treatment of early breast 
cancer with PBI using proton beam. This study 
is supported by a research grant from the 
Japanese government's Ministry of Education, 
Culture, Sports, Science and Technology. 
Phase | will determine the optimum dose of 
radiation; Phase II will evaluate safety and 


efficacy. The primary endpoint is safety (the 
frequency and severity of adverse events); 
based on the experience of RTOG 0319”, safety 
will be considered acceptable if the incidence 
of grade 3 or higher adverse events does 

not exceed 10%. Efficacy will be evaluated 

by secondary endpoints including time to 
recurrence, localization of recurrence, overall 
survival and cosmetic outcomes. 


Medipolis Technique: 
immobilization for breast cancer 
radiotherapy 

It is not always necessary to aim for great 
precision when the whole breast is irradiated 
after breast-conserving surgery but targeted 
proton beam therapy requires immobilization 
of the breast and this presents practical 
difficulties for clinicians. Further, there is no 
consensus on the best position of the patient at 
fixation because the shape of the breasts may 
change due to the effect of gravity. Irradiation 
of the breast with the patient in the supine (that 
is, face-up) position would lead to a greater 
damage to the lungs and heart compared 

with the prone (that is, face-down) position. 
However, it is difficult to obtain high geometric 
accuracy during irradiation in the prone 
position because the breast is not secured and 
it shifts with the movement of the thorax due 
to respiration. The Medipolis Technique utilizes 
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a hybrid breast-immobilization system (HyBIS) 
that exploits the best aspects of the supine and 
prone positions. 


Using HyBIS: overview 

There are three phases to HyBIS: preparation, 
setup and treatment. In the preparation phase, 
the patient is secured (except for the affected 
breast) on a whole body immobilization system 
that allows for adjustments in position. The 
device incorporates a photo scanning system 
that generates a digital image of the breast 
from which a unique breast cup is created. 

The breast cup is fitted to the breast and held 
in place with a specially designed retention 
apparatus. 

With the patient and breast immobilized, 
the setup phase of simulation and planning 
treatment based on CT images can begin. 
These data provide the basis on which the 
treatment phase is implemented. Treatment 
ends with the removal of the breast cup and 
relaxation in a bathtub. 


Novel technologies 

MPTRC has incorporated novel technologies 
in the development of HyBIS. The whole body 
immobilization system was designed to hold 
the entire body of a patient in a polycarbonate 
case on a purpose-built aluminum stretcher, 
with vacuum cushions and belts (Figure 3A). 
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The case can float and move vertically and 
horizontally by using compressed carbon 
dioxide gas (Figure 3B). The reason why we 
choose carbon dioxide gas is that a compressed 
gas cylinder of carbon dioxide gas can safely be 
brought into the magnetic resonance imaging 
room. It also includes a breath-synchronizing 
system, a localizer and a converting device to 
adjust the patient's position. 

A converting device was developed to 
change or maintain the position of the patient 
while held in the whole body immobilization 
system (Figure 3C). The device, which can 
accommodate patients up to 1.75 min height 
and up to 80kg in weight, incorporates 
pressure sensors to prevent falls anda 
two-speed motor. The rotating angle can be 
adjusted within <0.1 degree. 

The scanning system comprises a movable 
array of ten cameras controlled by a touch 
screen terminal and powered by a large- 
capacity storage battery. By using battery, it is 
easy to handle the scanning system. This device 
takes a series of 10 images of the breast while 
the patient holds her breath. This procedure is 
repeated twice at slightly different locations to 
obtain images from 30 different viewpoints. 

These 30 images generate a three- 
dimensional (3D) formula of the breast, from 
which a 3D printer creates a breast cup unique 
to each patient. The cup is made of resin lined 
with an adhesive, with a hollow center for the 
nipple (Figure 3D and 3E). It is connected to a 
fixation device with a rod (Figure 3F). The lining 
is highly adhesive and cannot be removed 
until the patient soaks herself in a bathtub. 

As long as the cup is dismantled and washed 
thoroughly, it can be used repeatedly without 
losing its adhesive properties. 

The breast cup is fitted using a specially- 
designed apparatus incorporating a level 
gauge, a pressure sensor, three small video 
cameras and a localizer (Figure 3D). The 
apparatus can be moved and easily locked 
in place on the breast. Positional accuracy is 
ensured by a laser beam focused on a specific 
point of the breast and adjusted during breath- 
holding using the cameras (Figure 3E). The 
breast cup adheres to the skin with a pushing 
pressure and is then uncoupled from the 
apparatus. 

The breast cup is firmly held in a device 
made of hard resin (Figure 3F) that enables 
adjustment in three axes (horizontal, 
longitudinal, and vertical). It is also directly 
linked to the whole body immobilization 
system to minimize the influence of the 
thoracic movement while breathing. 
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Figure 2 | Treatment room (A) and rotating gantry (B). The rotating gantry is a device for focusing the 
proton beam. The 175-ton device rotates 360 degrees around the patient, and can deliver the proton 


beam from any direction. 


The devices used in HyBIS were tested ona 
specially-developed deformable dummy. The 
dummy was designed to simulate a 60 year-old 
woman and it accurately modelled the shape 
and quality of the breast. The dummy body was 
complete, from the head to the femur and was 
covered with thin elastic film. In addition, the 
dummy thorax expands and contracts under 
remote control to simulate respiration. 


Treatment of locally advanced 
pancreatic cancer 

Proton beam therapy can be applied to other 
types of cancer where radiotherapy must 


be highly targeted and its application in the 
treatment of pancreatic cancer, combined 
chemotherapy with gemcitabine, was 
developed at the Hyogo lon Beam Medical 
Center in 2008'. Refinements such as rotating 
gantries and the technique of synchronizing 
irradiation with respiration (the respiratory 
gating technique), developed in the treatment 
of breast cancer, have led to the emergence of 
proton beam therapy as an important option 
for treating locally advanced pancreatic 
cancer. However, this involves a complex 
treatment plan and requires an experienced 
team. 
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Figure 3 | Devices to immobilize a breast. (A) The dummy is immobilized in the whole body immobilization system except for the right breast and the 


head. (B) The immobilized dummy is moved into the position converting device. (C) The position converting device is inclined at 155 degrees and the photo 


scanning system is set up under the gantry. (D) A breast cup is set on the fitting apparatus and a laser fixes position on the nipple. (E) The breast cup is fitted 


and fixed to the breast with a holding device. (F, G) An image of irradiating proton beams for breast cancer. 


Effective radiotherapy of locally advanced 
pancreatic cancer can improve the patient's 
prognosis and reduce pain. This procedure, 
however, is challenging because the pancreas 
is surrounded by the stomach, duodenum 
and jejunum and, posteriorly, the spinal cord 
and aorta. The key to success is the ability 
to deliver a radical dose of radiation to the 
pancreatic tumour while limiting the exposure 
of the surrounding digestive tract. 

X-ray radiotherapy for pancreatic cancer 
is associated with a high risk of adverse 
effects, even with mitigating strategies 
such as stereotactic body radiation therapy 
or intensity-modulated radiation therapy. 
This leads to an interruption of therapy, or a 
dose reduction or even discontinuation of 
treatment. Conversely, the Bragg peak effect 
of a proton beam means that, when therapy 
is delivered from the patient's back, the 
digestive tract is spared. 

Hyogo lon Beam Medical Center uses the 
field-within-a-field technique in the treatment 
of pancreatic cancer. This entails establishing 
two target areas for irradiation: a large area 
involving the adjacent digestive tract anda 
small area solely of the tumour. In this way, 
proton beam therapy can be targeted so 
that the tumour receives a higher dose of 
radiation than the surrounding normal tissues, 
increasing the effectiveness and reducing 


the toxicity of treatment. After completion of 
proton beam therapy, patients then receive 
adjuvant chemotherapy. 

Dr Yoshio Hishikawa began treating locally 
advanced pancreatic cancer at the Hyogo 
lon Beam Medical Center’ and he has further 
developed his work at MPTRC to optimize 
the dose of radiation to individual patients. 
High-level team performance by physicians 
specializing in radiation therapy, medical 
physicists, and radiation technologists, 
supported by nurses, is required to deliver this 
service effectively. As of August 2015, more 
than 140 patients with pancreatic cancer have 
been successfully treated at MPTRC. 


Reaching out 

MPTRC has now treated more than 

1,500 cancer patients, most of whom are 
Japanese. Recently, however, the Center 

has begun to admit more people from 
overseas, including China and Russia. In 
September 2013, MPTRC was the first proton 
therapy facility in Japan to be accredited 

by Joint Commission International (www. 
jointcommissioninternational.org). This 
recognition of the quality and safety of the 
services at MPTRC supports the center’s aim of 
offering patient-friendly proton beam therapy 
and traditional Japanese hospitality, or 
Omotenashi, to people with cancer worldwide. 
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he breadth and complexity of brain and other nervous-system 

disorders make them some of the most difficult conditions to 

diagnose and treat, especially in the developing world where 
there may only be one psychiatrist or neurologist in an entire country. 
‘These disorders occur throughout the lifespan — from infants starved 
of oxygen during difficult births, to children whose cognitive ability 
is stunted owing to malnutrition or exposure to infections or toxins, 
and to adults who develop depression, movement disorders or demen- 
tia. Mental health and behavioural issues are the cause of the largest 
burden of disability, and account for a staggering 184 million disabili- 
ty-adjusted life years (DALYs)!, according to the US Institute of Health 
Metrics and Evaluation. 

That is why the Fogarty International Center (Fogarty) has been 
working with its National Institutes of Health (NIH) partners for more 
than 10 years through the Brain Disorders in the Developing World: 
Research Across the Lifespan Program to catalyse this field of research 
and develop badly needed expertise in low- and middle-income coun- 
tries (LMICs). To mark this milestone, Fogarty felt it was important to 
review progress and consider how best to move forward. 

In its first decade, the programme awarded about US$84 million 
in more than 150 grants. Programme investigators have generated dis- 
coveries detailed in 435 peer-reviewed articles and 14 books or book 
chapters’. Scientists developed clinical assessment tools designed for 
low-resource settings, produced and tested novel interventions, and 
identified promising new approaches. The programme model has en- 
abled investigators in the United States and other high-income coun- 
tries to gain experience working in LMIC settings, while strengthening 
the research base of both US and LMIC institutions through collabo- 
rations. To help sustain this 
significant momentum, pro- 
gramme funding has support- 
ed long-term training of at 


Given the scientific 


strides that have been least 138 LMIC scientists. Data 
e generated by programme par- 

made by working ticipants have provided crucial 
together, it is hoped evidence that has been used 
° to inform international and 

that this research national practice and policy. 
and training agenda Examples include identifying 
and helping to remove a glob- 

can move f orward al barrier to the availability 
in new and exciting of an anti-epileptic drug in 
directions Africa, increasing awareness 


of fetal alcohol syndrome on 

a national level in Russia, 

and convincing the Peruvian 
government to institute acyclovir treatment for herpes simplex virus 
encephalitis. Finally, the initiative has built partnerships for research 
between US and foreign academic institutions, and strengthened the 
long-term research capacity of the LMIC institutions. It has extended 
the frontiers of neuroscience research to include many LMIC institu- 
tions and expanded the research workforce to more rapidly address 
some of the world’s most pressing health problems. 
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The global brain programme would not have been possible with- 
out the full partnership of the other institutes at the NIH who joined 
forces to push forward the frontiers of brain research to include insti- 
tutions and scientists in the developing world — the Eunice Kennedy 
Shriver National Institute of Child Health and Human Development, 
the National Eye Institute, the National Institute on Aging, the Nation- 
al Institute on Alcohol Abuse and Alcoholism, the National Institute 
on Deafness and Other Communication Disorders, the National Insti- 
tute on Drug Abuse, the National Institute of Environmental Health 
Sciences, the National Institute of Mental Health, the National Insti- 
tute of Neurological Disorders and Stroke, and the Office of Dietary 
Supplements. 

In February 2014 funding partners, grantees and brain-disorder 
experts convened for several days of robust discussions to explore 
knowledge gaps and research opportunities. These conversations, or- 
ganized by Fogarty’s Center for Global Health Studies, were the genesis 
for this series of Reviews, which it is hoped will catalyse the interna- 
tional community to devote attention and resources to this crucial re- 
search agenda. Fogarty owes a debt of gratitude to all the participants, 
authors and the editors Donald Silberberg and Rajesh Kalaria. 

Given the scientific strides that have been made in imaging, di- 
agnostics, nanoscience, novel surgical interventions and genetics, by 
working together, it is hoped that this research and training agenda 
can move forward in new and exciting directions. Although the pro- 
gress made is encouraging and mental health, substance misuse and 
chemical exposures are included in the United Nations proposal for 
sustainable development goals’, much more needs to be done. 

It is hoped that this supplement will inspire other scientists and 
funding partners to join us in addressing the full spectrum of research, 
training, implementation and policy questions needed to alleviate the 
suffering caused by global brain disorders in LMICs and other countries 
around the world. 
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Brain and other nervous system disorders across 
the lifespan — global challenges and opportunities 


Donald Silberberg', Nalini P. Anand’, Kathleen Michels* & Raj N. Kalaria* 


This is an exciting time for scientific discovery that aims to reduce the frequency and impact of neurological, mental health 
and substance-use disorders. As it became increasingly clear that low- and middle-income countries have a disproportionate 
share of these disorders, and that many of the problems are best addressed by indigenous researchers who can seek con- 
text-sensitive solutions, the US National Institutes of Health and other research funders began to invest more in low- and 
middle-income country-focused research and research capacity-building to confront this significant public health challenge. 
In an effort to identify existing information, knowledge gaps, and emerging research and research capacity-building oppor- 
tunities that are particularly relevant to low- and middle-income countries, in February 2014 the Center for Global Health 
Studies at the National Institutes of Health Fogarty International Center held a workshop to explore these issues with scientif- 
ic experts from low- and middle-income countries and the United States. This evolved into the preparation of the Reviews in 
this supplement, which is designed to highlight opportunities and challenges associated with topical areas in brain-disorders 
research over the coming decade. This Introduction highlights some of the over-arching and intersecting priorities for ad- 
dressing causes, prevention, treatment and rehabilitation as well as best practices to promote overall nervous system health. 
We review some brain disorders in low- and middle-income countries, while the Reviews describe relevant issues and the 


epidemiology of particular conditions in greater depth. 
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he proportion of the global burden of disease (GBD) that is attrib- 

utable to neurological, mental health, developmental and sub- 

stance-use (NMDS) disorders is expected to rise worldwide, partly 
because of the projected increase in the number of individuals reaching 
the age at which they are at risk of onset of many of these disorders!. This 
rise will be steeper in low- and middle-income countries (LMICs) given 
the long-term effects of early life trauma, infectious disease and malnu- 
trition, which contribute to the development of these disorders that in 
turn lead to early death or a lifetime of disability. Despite their signifi- 
cant contribution to the burden of disease and disability, NMDS disor- 
ders have been largely absent from the global health research agenda, 
and LMICs have insufficient capacity to address them. The past two dec- 
ades have witnessed increased attention to, and investments in, NMDS 
issues. However, given the rising burden of NMDS disorders worldwide 
and opportunities to build on scientific advances while strengthening 
LMIC research capacity, we are at a crucial juncture in moving this agen- 
da forward. This is a time to reflect on and use what we have learned to 
confront existing, and prepare for future, challenges. We hope that the 
research and research capacity-building priorities discussed in this col- 
lection will help to galvanize action to confront the rising tide of brain 
and other nervous system disorders. 


INTERNATIONAL HEALTH COMMUNITY RECOGNITION 
Before the publication of the World Bank’s 1993 seminal Annual 


Report, Investing in Health, neurological, psychiatric, developmental 
and substance-use disorders in LMICs were considered unusual, dif- 
ficult to understand and a low priority for research investment. The 
1993 report, based largely on the first studies of the global burden 
of disease’, has since been refined and progressively updated. Prior 
to the implementation of measures such as the disability- adjusted 
life year (DALY; Table 1), the global burden of disease was primar- 
ily quantified in terms of mortality. With the advent of the DALY, 
the importance of neurological and psychiatric disorders became 
evident, accounting for approximately 28% of the global burden of 
disease. Subsequently, the US Institute of Medicine called for the 
need to direct research resources towards addressing NMDS disor- 
ders in LMICs* in its 2001 study and report, Neurological, Psychiatric 
and Developmental Disorders — Meeting the Challenge in the Developing 
World. This report not only encapsulated a growing body of evidence 
regarding the impact of brain disorders, but it also provided the 
seed for a new initiative to support research and research training 
in global brain disorders: the Brain Disorders in the Developing World: 
Research Across the Lifespan Program (brain programme) support- 
ed by the National Institutes of Health (NIH) Fogarty International 
Center, several NIH Institute and Center partners, and other organ- 
izations (Box 1). Further impetus was added by inclusion of several 
chapters on NMDS disorders in the 2006 second edition of Disease 
Control Priorities in Developing Countries?. 


‘Department of Neurology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA. *Division of International Science Policy, Planning and Evaluation and 
Center for Global, Health Studies, Fogarty International Center, National Institutes of Health, Bethesda, Maryland 20892, USA. *Division of International Research and 
Training, Fogarty International Center, National Institutes of Health, Bethesda, Maryland 20892, USA. ‘Institute of Neuroscience, Newcastle University, Newcastle- 
upon-Tyne NE4 5PL, UK. Correspondence should be addressed to D. S. e-mail: silberbe@mail.med.upenn.edu or R. N. K. e-mail: r.n.kalaria@ncl.ac.uk. 
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Table 1 | Proportions of age-standardized estimated disability adjusted life years (DALYs; >0.5%) attributable to all neurological, mental health and substance-use 
disorders by high-income and low- and middle-income countries (LMICs) 


Disorder Absolute DALYs* as global per thousand Percentage in high-income countries 
people (rank) T (developed) 


Percentage in low- and middle-income 
countries (developing) 


HIV/AIDS$ 130,900 1.19 3.67 
All neglected tropical diseases 108,700 0.072 5.16 
and malariat 

Nutritional deficiencies§ 85,300 (3) 0.63 3.95 
Low back pain 83,100 (1) 5.8 2.78 
Neonatal encephalopathy (birth 50,150 0.33 2.33 
asphyxia/trauma) 

Sensory organ diseases|| 34,700 (5) 1.68 35) 
Neck pain 33,640 (4) 2.24 1.14 
Meningitis$ 29,400 0.13 1.38 
Brain and nervous system cancers | 6,100 0.53 0.19 
Down’s syndrome 705 0.8 aly 


Cerebrovascular disease (all 102,200 5.97 3.79 
strokes)# 

Neurological disorders (all) 73,800 442 2.71 
Migraine 22,360 (6) AL ail 0.84 
Epilepsy 17,400 (23) 0.44 0.75 
Alzheimer’s disease and other 11,350 (21) ays 0.22 
types of dementia 

Other diagnosed neurological 17,870 0.53 0.76 


disorders 


Mental health (all) ** 185,200 (29) Aik al 6.73 
Unipolar depressive disorders 74,260 4.01 2.81 
Major depressive disorder 63,200 (2) Biv 2.39 
Anxiety disorders 26,830 (9) 1.6 0.98 
Schizophrenia 15,000 (11) 0.85 0.49 
Bipolar depressive disorders 12,870 (17) 0.63 0.5 

Dysthymia 11,100 (16) 0.6 0.42 


Drug-use disorders 20,000 (19) toss) 0.67 
Alcohol-use disorders 17,640 (22) 1.52 0.56 
Total rate estimates (major 716,905 36.15 29.1 


categories with direct nervous- 
system involvement and >0.5% 
DALYs only in either a developed 
or developing region) 


*DALY is defined as a measure of overall disease burden and expressed as the sum of years of potential life lost due to ill-health, disability or premature mortality. DALYs", listed high to low for each 
neurological, mental health and substance-use disorder type, for only those disorders with >0.5% rates in either high-income country or LMIC. Estimates for high-income counties were derived from 


the ‘developed’ region data whereas 
was foremost. +Includes cerebral mal 


those for LMIC were from ‘developing’ region data’. tRanked as disorders among the top 25 causes of global years lived with disability (YLDs)"*. Conduct disorder 
aria, encephalitis and HIV dementia. §Includes hearing and vision loss, including macular degeneration. ||lron-deficiency anaemia is key. Non-communicable neu- 


rological disorders include types of dementia, Parkinson's disease and related disorders, epilepsy, multiple sclerosis, migraine, tension-type headache and other neurological disorders. #Includes both 
ischaemic (70-80%) and haemorrhagic strokes (20-30%). **Mental health includes major depressive disorder, dysthymia, unipolar and bipolar depressive disorders, schizophrenia, anxiety disorders, 
eating disorders, autism, Asperger's syndrome, attention deficit hyperactivity disorder, conduct disorder, idiopathic intellectual disability and other mental and behavioural disorders. ttSubstance-use 


disorders include alcohol, opioid, cocaine, amphetamines, cannabis and other drug use. 


Subsequently, The Grand Challenges Canada — Saving Brains and 
Global Mental Health, The EU Tropical Diseases Research Program, 
the UK Wellcome Trust, the UK Medical Research Council and others 
have joined to support global research and training to address brain 
disorders in LMICs (Fig. 1). Despite the successes and investments 
from these efforts and the willing support from numerous partners, 
including the US Agency for Overseas Development and the Bill and 
Melinda Gates Foundation, the implications of the burden of NMDS 
disorders in LMICs remains insufficiently addressed by appropriate 
policies and research and capacity-building investments. As we 
analyse these important ventures (Fig. 1), there are clearly several 
geographical gaps in investment, concerns for the extent of overlap 
in research funding for certain countries, and the potential for part- 
nerships between funders and between countries. 
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BURDEN AND RANGE OF NERVOUS-SYSTEM DISORDERS 
With the advent of the DALY, the importance of neurological and psy- 
chiatric disorders was undeniable; by 2010 all causes of NMDS disorders, 
including stroke, were estimated to account for more than 29° of the 
GBD (Table 1). In tandem with increased life expectancy®, the GBD has 
shifted from premature death to increased years lived in disability per 
100,000. Not surprisingly, as the GBD has continued to move in the di- 
rection of non-communicable diseases, the burden of mental health and 
substance-use disorders has increased by around 40% in the past two 
decades’. Non-communicable diseases have been found to be the major 
cause of death and disability (Table 1), it is therefore time to include them 
as crucial priorities for research and policy initiatives in LMICs. 
Although our appreciation of the importance of NMDs has matured 
significantly, serious contextual problems that obfuscate the true burden 
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BOX 1| THE GLOBAL BRAIN PROGRAMME 


Scope of, and data from, 2003 to 2013 National Institutes of Health/ 
Fogarty International Center ongoing Global Brain programme". 


e Support the development and conduct of innovative, collaborative 
research and research training projects, between developed and 
developing country scientists, on brain disorders throughout life, 
relevant to LMICs" 

e The research should include a lifespan approach, for example 
nutrition or early exposure with sequelae or impact throughout life 

e The programme has created a global network of researchers in 45 
countries 

e Projects have informed policies and programmes at the national 
and international levels 

e Grants have resulted in 435 peer-reviewed publications from 249 
unique journals 

e The programme has catalysed new research projects supported by 
other funders 

e Outputs include new tools for clinical assessment in the LMIC 
context, laboratory tools and methods 


of these disorders remain. By definition, disorders in the GBD classifi- 
cation are attributed to underlying causes rather than to clinical mani- 
festations. Thus, for example, disabilities that result from neuropathies 
are attributed to diabetes or HIV. Similarly, among types of epilepsy, only 
idiopathic epilepsies are considered to be neurological burdens. Where an 
underlying cause for seizures is known or suspected (for example, trau- 
matic brain injury), the GBD methodology attributes the epilepsy-asso- 
ciated disability to the underlying cause. Highest-ranked causes of death 
in the GBD 2010 study include several that are neurological in nature, for 
example stroke (ranked 2nd), malaria (ranked 11th), neonatal enceph- 
alopathy (ranked 24th) and meningitis (ranked 29th). All these condi- 
tions, none of which are included in the neurological category, rank sim- 
ilarly high in terms of their contributions to global disability®. 

Neglected tropical disorders (NTD) are also disproportionately neu- 
rological in nature’. Three of the seventeen NTDs recognized by the World 
Health Organization (WHO) are primarily neurological infections (ra- 
bies, human African trypanosomiasis and leprosy) and, of the remain- 
ing NTDs, some of the more severe manifestations are the result of nerv- 
ous-system involvement (central nervous system schistosomiasis and 
Chagas-related stroke). When these are considered in this context, the 
true estimate of DALYs that are accounted for by NMDS disorders would 
be substantially higher than 29° (Table 1). 

Research undertaken since the US Decade of the Brain campaign in 
the 1990s has established that many of the most common and disabling 
neurological conditions are preventable or remedial with inexpensive 
therapies. Treatment for epilepsy and secondary stroke prevention are 
ranked by the World Bank among the ‘best buys’ in global health’. In 
the past decade, relatively small investments in scientific inquiry that 
are relevant to brain disorders in LMICs are yielding crucial insights, 
which are applicable to the broader global community. These findings 
also point the way to the intervention studies that should follow, which 
range from definitive clinical trials to population-level interventions 
targeting risk factors for NMDS disorders. Where research has pointed 
toward more optimal ways to structure medical education or provide 
health services, implementation with rigorous evaluations is needed 
and plans for a broad scale-up delineated. 

Given this unique point in time, in February 2014 the Center for 
Global Health Studies at the Fogarty International Center held a work- 
shop to explore the state of the science and to identify emerging research 
and research capacity-building priorities in brain disorders that are 
particularly relevant to LMICs. This evolved into the preparation of the 
Reviews in this collection, which is designed to highlight opportunities 
and challenges associated with specific topical areas in brain-disorders 
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IB Fogarty NIH projects ) # 
1B GC Canada projects ; 
160 Fogarty plus GC Canada projects 
1B Fogarty and WT projects 

IB Fogarty , GC Canada and WT projects 


Figure 1 | Research support for neurological, mental health and substance-use 
(NMDS) disorders in low- and middle-income countries. Colour-coded distribu- 
tion of concerted programmes from principal agencies that support (currently 
and during the past decade) projects on NMDS disorders. Further details can be 
obtained from the agency reports!"*. There are numerous other organizations 
that support programmes that affect brain health, but that are not shown on 
this map. Fogarty, Fogarty International Center; GC Canada, Grand Challenges 
Canada — Saving Brains and Global Mental Health; WT, UK Wellcome Trust. 


research in the coming decade, including causes, prevention, treatment 
and rehabilitation. We highlight some of the over-arching and intersect- 
ing priorities for addressing brain disorders in LMICs, and the subsequent 
Reviews describe these issues and the epidemiology of particular condi- 
tions in greater depth. 


THEMES AND INTERSECTING RESEARCH APPROACHES 
Despite financial and human-resource challenges, there have been 
many exciting discoveries related to NMDS disorders that have glob- 
al implications. Examples include refinement of the classification of 
HIV/AIDS; realization of more substance-use disorders; definition 
of the sequelae of infectious diseases, seizure disorders and behav- 
ioural disorders; and identification of mechanisms of environmental 
toxicants in sub-Saharan Africa. Similarly, genetic studies associated 
with neurodegenerative disorders, for example Huntington’s disease 
and Alzheimer’s disease in Venezuela and Columbia have advanced 
our knowledge of diagnosis, risk factors and prospective treatments. 
Genetic and cognitive studies in Brazil are leading to interventions 
that may help to mitigate cognitive deficits in children owing to mal- 
nutrition. 

The unique genetic makeup, environmental and traumatic expo- 
sures, infections, the local health system and nutritional challenges 
of diverse LMIC settings, mean that interventions and approaches 
that have been developed and tested in high-income country set- 
tings will often not be feasible or effective if simply transplanted to 
LMICs. In fact, solutions developed in LMICs may have global appli- 
cations, such as new surgical techniques for children with incipient 
hydrocephalus developed in Uganda (see page S155) and may lead 
to a decrease or an elimination of disablity caused by this condi- 
tion, and perhaps cheaper and better treatments for children around 
the world. Thus, it is crucial that research be conducted in the en- 
vironments in which the techniques are intended to be used. Sev- 
eral Reviews in this series provide examples of why this is the case 
in the context of brain disorders (for example, disorders associat - 
ed with development, trauma, environment and adolescence). The 
conduct of high-quality research requires a cadre of in-country and 
well-trained scientists, and an adequately resourced research infra- 
structure that can support their research (see page S207). This series 
highlights priorities for research and research training that are re- 
lated to specific diseases and conditions. We highlight overarching 
priorities included in these papers that cut across multiple disease 
areas (Box 2). 
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BOX 2 | PRIORITIES FOR DIFFERENT THEMES ACROSS MULTIPLE DISORDERS 


Surveillance and epidemiology 

e Determination of prevalence and incidence of specific brain 
disorders, including in vulnerable populations 

e Strengthening of reporting infrastructure, data quality and disease 
registries, and standardization of registries for country comparisons 

e Use of statistical modelling where possible to inform resource 
allocation and implementation of interventions 

e Prevalence of specific risk factors and measures of their impact 
among affected populations 

e Better design of disease epidemiology that accurately captures 
incidence, type and duration of nervous system sequelae of 
infections and other conditions 


Basic and clinical research 

e Understanding the epigenetic effects on the nervous system 
resulting from malnutrition, infections, environmental exposures and 
psychosocial factors 

e Better understanding of the influence of genetics and the genome on 
brain-disorder pathogenesis and progression 

e Explore the relationship between an individual's and a population’s 
microbiome and correlates with the presence of diverse disorders 

e Understanding of pathways that lead to late onset of neurological 
disease owing to early exposures 

e Advance understanding of gene-environment-brain interactions 

e Assessment of risks and interactions of co-infections and co-morbidities 

e Development of effective and feasible physical, occupational and 
cognitive rehabilitation interventions 

¢ Creation of biobanking, including of brain tissues in LMICs. 


WHAT NEXT? 

Resource constraints and the unique factors that people living in LMICs 
face impose a particularly onerous burden on these countries, where 
most of those with brain and other nervous-system disorders live. This 
burden significantly affects the ability of children and adolescents to 
thrive and live out their true potential, and the ability of young adults 
to be economically productive and support their families, as well as 
the opportunity for older adults to age in safe and nurturing settings. 
However, we may be at a tipping point for research related to glob- 
al brain disorders. Over the past few decades, exciting basic science 
discoveries have been made, effective interventions have been de- 
veloped and advances in technology have set the stage for a research 
agenda that can lead to unprecedented progress in this field. In addi- 
tion, research capacity strengthening in LMICs and an opportunity for 
comparative studies of the markedly different environments that exist 
between high-income countries and LMICs will yield universally ben- 
eficial knowledge. 

Reduction of disease and disability that are associated with brain 
and other nervous-system disorders over the next decade will demand 
increased engagement from, and collaboration among, the scientific 
community, research funding agencies, national governments, aca- 
demic institutions, multilateral organizations, advocacy organizations 
and health providers to increase both research in LMICs and the in- 
digenous research capacity. We need to build on current knowledge of 
overall neurological health, and improve the lives of those living with 
brain and other nervous system disorders at any stage of life. 
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Implementation science and health systems 

e Engagement of diverse stakeholders (including decision makers 
and programme implementers) to facilitate uptake and scale-up of 
effective interventions 

e Evaluation of task-sharing models to address health-care provider 
shortages and to help create efficiencies in the health-care system 
and facilitate the delivery of effective services 

e Research on how best to scale-up interventions that have proven 
efficacious in smaller, controlled settings 


Technology advancement 

e Development and refinement of effective point-of-care diagnostic 
tools, particularly for genetic-based neurological disorders 

e Development and testing of mobile-technology interventions to 
screen, diagnose and monitor treatment of brain disorders 

e Development and implementation of low-cost tools and techniques 
for the treatment and rehabilitation of conditions arising from brain 
and other nervous-system trauma 


Capacity-building and research-infrastructure needs 

e More clinician—neuroscientist researchers and others who are trained 
to deliver appropriate care for patients with brain disorders and 
participate in conducting research 

e Strengthen capacity in neuroethics 

e Development of enhanced and culturally adaptable cognitive 
assessment and screening tools 

e Increased laboratory capacity 

e Increased access to electrodiagnostics, genomic sequencing and 
neuroimaging technologies 
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Reducing neurodevelopmental disorders and 
disability through research and interventions 


Michael J. Boivin'*, Angelina M. Kakooza?, Benjamin C. Warf*, Leslie L. Davidson*© & Elena L. Grigorenko’ 


We define neurodevelopment as the dynamic inter-relationship between genetic, brain, cognitive, emotional and behavioural 
processes across the developmental lifespan. Significant and persistent disruption to this dynamic process through environmen- 
tal and genetic risk can lead to neurodevelopmental disorders and disability. Research designed to ameliorate neurodevelop- 
mental disorders in low- and middle-income countries, as well as globally, will benefit enormously from the ongoing advances in 
understanding their genetic and epigenetic causes, as modified by environment and culture. We provide examples of advances 
in the prevention and treatment of, and the rehabilitation of those with, neurodevelopment disorders in low- and middle-in- 
come countries, along with opportunities for further strategic research initiatives. Our examples are not the only possibilities for 
strategic research, but they illustrate problems that, when solved, could have a considerable impact in low-resource settings. In 
each instance, research in low- and middle-income countries led to innovations in identification, surveillance and treatment of 
a neurodevelopmental disorder. These innovations have also been integrated with genotypic mapping of neurodevelopmental 
disorders, forming important preventative and rehabilitative interventions with the potential for high impact. These advances 
will ultimately allow us to understand how epigenetic influences shape neurodevelopmental risk and resilience over time and 
across populations. Clearly, the most strategic areas of research opportunity involve cross-disciplinary integration at the inter- 
section between the environment, brain or behaviour neurodevelopment, and genetic and epigenetic science. At these junctions 
a robust integrative cross-disciplinary scientific approach is catalysing the creation of technologies and interventions for old 
problems. Such approaches will enable us to achieve and sustain the United Nations moral and legal mandate for child health and 


full development as a basic global human right. 


Nature 527, S155-S160 (19 November 2015), DOI: 10.1038 /naturel6029 


This article has not been written or reviewed by Nature editors. Nature accepts no responsibility for the accuracy of the information provided. 


ne evaluation of early childhood developmental status in low- 
and middle-income countries (LMICs) estimates that 15.7% 
of children are significantly delayed in their cognitive devel- 
opment, 26.3°% in socioemotional development and 36.8% in either or 
both (D. C. McCoy, personal communication). Stunting, low wealth and 
living in a rural area are significantly associated with neurodevelopmen- 
tal delay; most of the children live in Africa and eastern Asia. Fortunate- 
ly, neurodevelopmental science is benefitting from rapidly expanding 
technologies for the integration of the environmental (for example, 
infectious disease, nutritional and carer quality), brain-related (for ex- 
ample, developmental neuroscience and brain imaging) and genetic (for 
example, epigenetic modelling and genomic big data) domains that drive 
neurodevelopment. Figure 1 illustrates the mutually interactive nature of 
these three developmental domains, along with the current strategic ar- 
eas of research at the environment-brain-gene interface (Box 1). 
Advances in developmental science have triggered a recon- 
ceptualization of neurodevelopment based on the recognition that 


developmental processes are a part of child health in the broader con- 
text of communicable and non-communicable disease'. The develop- 
mental origins of the health and disease hypothesis proposes that the 
physiological processes of developmental plasticity operate in early 
childhood, but have the potential for adverse consequences in later 
life? Consequently, childhood — particularly early childhood — is a 
high-priority target for both preventive and remediating interventions 
to address the pervasive developmental needs in LMICs (D. C. McCoy, 
personal communication). 

In this Review, we describe several high-impact findings that 
have emerged from research in low-resource settings that pertain to 
the developmental milieu of the child, its relationship to the brain and 
behavioural neurodevelopmental integrity of the child (neurodevelop- 
mental disorders), and the genetic and epigenetic underpinnings that 
can drive this relationship. As we review key scientific advances in 
each of these three domains, we propose strategic areas of ongoing and 
future research that could provide innovative models to fuel significant 
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advances and evidence-based interventions for meeting the develop- 
mental needs of children. We conclude by summarizing ways in which 
this model (environment, brain and gene) provides rich opportunities 
for a more global approach to child-development science, making it 
possible to achieve the UNICEF mandate of full child health and devel- 
opment for all*. 


APPROACHES TO MALARIA 

In 2013 there were around 198 million cases of malaria of which 
584,000 resulted in death’. A child dies from malaria every minute, 
and one in four survivors present with significant neurodevelopmental 
impairment*’. However, cognitive rehabilitation, speech and phys- 
ical therapy, and carer-training interventions can improve cognitive 
performance and behaviour of treated mother-child pairs*°. As re- 
habilitation approaches are evaluated, there is mounting evidence of 
the neurocognitive benefits of computerized cognitive rehabilitation 
training (CCRT) in African children with a brain injury as a result of 
severe malaria and in those with HIV-related brain injury!®. Dissem- 
ination and implementation science must now inform innovative ap- 
proaches to bring such interventions to scale in low-resource commu- 
nities. Mobile network health (mHealth) research opportunities are a 
high priority, given the ever-increasing access that children and ad- 
olescents in low-resource settings have to mobile-based internet and 
computing technologies". Another key strategic research opportunity 
is to evaluate the impact of neurocognitive rehabilitation interventions 
such as CCRT, on the enhancement of brain-development neuropro- 
tective factors!" (Box 2). We can then evaluate the extent to which 
such brain-based biomarkers mediate the neuropsychological benefits 
of CCRT, along with how neurocognitive rehabilitative interventions 
diminish biomarkers of brain inflammation (such as tumour necrosis 
factor-a. (TNF-a) and creatinine). 


APPROACHES TO PAEDIATRIC HIV 

Globally, roughly 3.4 million children live with HIV infection and are 
at high risk of significant neurodevelopmental disabilities. Of these, 
almost 90% live in Africa where only 24% of infected children have 
access to anti-retroviral (ARV) treatment'*. These children’s environ- 
mental risk factors are compounded by poor nutrition owing to protein 
and specific micronutrient deficiencies’. They also often have parasit- 
ic, respiratory and enteric diarrheal infections’. Such compounded 
risk exists whether a child is infected with HIV (proximal risk) or lives 
in a household or community where HIV has a persisting and signifi- 
cant disruptive impact (distal risk)!’. Multifaceted risk for all kinds of 
early developmental insults (for example, infection, malnutrition and 
poverty) demands that children in low-resource communities need a 
comprehensive package of assessments and interventions to holistical- 
ly enhance their development”®. 

Recent epigenetic evidence suggests that chronic poverty may 
‘shrink’ children’s brains over successive generations as document- 
ed by longitudinal multigeneration brain-imaging research'*. These 
considerations justify the junction between the environment and 
the brain as a highly strategic point of intervention. This is further 
illustrated by the strategic importance of implementation-science 
research in designing an effective comprehensive package of servic- 
es for antenatal and postnatal care. This is evident when considering 
at-risk adolescent mothers in LMICs and the heightened risk of neu- 
rodisability in their infants. Pregnancy in adolescence is associated 
with premature delivery, stillbirth, fetal distress, birth asphyxia, low 
birth weight and miscarriage”. Furthermore, if the adolescent moth- 
er is also suffering from malnutrition these risks are compounded for 
the infant — long-term effects as a consequence of low birth weight 
include stunting, poor neurodevelopmental outcomes, and increased 
susceptibility to cardiovascular and metabolic diseases such as obesi- 
ty and diabetes”. In fact, there is evidence that environmental factors 
such as nutrition can alter epigenetic modifications and thus play a 
part in the development of these disorders later in life”. The maternal 
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microbiome is also important to infant health outcomes, including 
the risk of pre-term birth, the development of gastrointestinal dis- 
eases such as irritable bowel syndrome, and the development of the 
immune system”. 

One of the greatest public health initiatives developed in the mod- 
ern era of infectious disease is the prevention of mother-to-child 
transmission (PMTCT) of HIV. These interventions have reduced per- 
inatal infection of children born to infected mothers from more than 
30% to less than 1%. However, there are still gestational neurode- 
velopmental risks associated with early exposure to ARVs**. One of 
the most exciting developments in the treatment of HIV has been the 
development and anti-retroviral characterization of VRCO1. This is a 
potent and broadly neutralizing anti-HIV monoclonal antibody that 
prevents HIV-1 transmission from plasmacytoid dendritic cells to CD4 
T lymphocytes”. Once proven safe for infants, such therapies should 
be administered as soon as possible after the diagnosis of HIV in in- 
fants, and the long-term neurodevelopmental and neurocognitive 
protective benefits of such innovative treatment strategies should be 
evaluated. These therapies could also be effective in the prevention of 
HIV transmission. 


TRAUMA-ASSOCIATED PSYCHIATRIC ILLNESS 

Another strategic research opportunity is to further evaluate how ma- 
ternal depression is associated with widespread changes in DNA meth- 
ylation in their offspring”®?’. Such epigenetic processes can result in 
heightened risk of depression and anxiety disorders in children as they 
become adults”*. How best to package and bring to scale a strategic set 
of intervention services that address this remains a neglected area in 
high-impact implementation science. Likewise, populations trauma- 
tized through conflict and genocide can pass on psychiatric disorders 
transgenerationally. This may be partly mediated by the hormonal ef- 
fects of maternal stress on neuropsychiatric risk for children in utero in 
regions where women have been traumatized through sexual violence 
in conflict zones (glucocorticoid~mediated inducement of cytokine in- 
flammatory responses causing methylation of DNA in children in utero). 
Such intergenerational epigenetic mechanisms of psychiatric disorders 
necessitate evidence-based and sustainable community-wide treat- 
ment strategies to address these disorders within the foundational 
mother-child caring fabric of that society”. Task shifting will be a 
crucial strategy in addressing such community mental health support 
services**!. There is evidence to support the effectiveness of a year- 
long maternal carer training programme for children who are affected 
by HIV in rural Uganda to facilitate child development in low-resource 
settings, while remediating maternal depression and enhancing carer 
functionality*". 


NODDING SYNDROME 

The beginning of the millennium was marked by the manifestation of 
the enigmatic condition nodding syndrome, which affects school-age 
children, and is reported in South Sudan, northern Uganda and south- 
ern Tanzania. This condition is characterized by episodes of repeti- 
tive nodding (dropping forward of the head) often coupled with sei- 
zure-like behaviours (for example, convulsions or staring spells) that 
occur during attempted feeding***. Nodding syndrome is also char- 
acterized by stunted brain growth, including significant brain atrophy 
near the hippocampal and glia matter of the brain and significant cer- 
ebellar involvement. This is accompanied by lifelong profound neuro- 
disability, severe behavioural problems and high mortality’. 

The nodding is caused by an atonic seizure, but the aetiology of this 
seizure is unknown, although associations with other developmental 
conditions have been established. Nodding syndrome is most preva- 
lent in areas with high infection rates of the parasitic worm Oncho- 
cerca volvulus — a nematode carried by black fly of the genus Simulium 
— the bites of which can cause onchocerciasis, a highly prevalent type 
of blindness caused by infection. Other reports suggest an associa- 
tion between the syndrome and malnutrition*>. Future research of this 
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syndrome must focus on understanding the aetiology so that it can be 
prevented, diagnosed early and treated effectively. Emerging diseases 
that profoundly affect children, such as nodding syndrome, provide 
an important opportunity for developing diagnostic, management and 
intervention techniques adapted to LMICs that, in turn, can be used for 
the prevention of worldwide outbreaks of diseases that lead to severe 
disability. 

Although nodding disease is highly localized, such enigmatic dis- 
orders that arise from time-to-time and result in profound neurodis- 
ability are important because they reveal the urgent need to develop 
scientific models that can be seamlessly integrated into emerging dis- 
ciplines. These include geographical ecological mapping, maps of par- 
asitology dispersion, genotypic mapping across populations and their 
geographical dispersions, and geographically mapped epidemiological 
risk of infectious disease. These multi-layered models must then be 
integrated with neuropathogenic mechanism models that include sen- 
sitive and specific brain inflammatory markers, as well as the corre- 
sponding neuropsychological sequelae of such central nervous system 
inflammatory markers. 


MALNUTRITION AND DISEASE 

Childhood malnutrition, both through prenatal and perinatal maternal 
micronutrient deficiencies**, infant micronutrient deficiencies*’, and 
protein-calorie deficiency, imposes a heavy burden on neurodevel- 
opment**“°. The primary effects of malnutrition have been associated 
with elevated mortality, morbidity, and risk of cognitive and soci- 
oemotional impairment. Although it has been extensively researched, 
and interventions have been attempted, malnutrition remains a se- 
rious challenge to children’s development in LMICs. Efforts have not 
yet succeeded in eliminating malnutrition or in successfully bringing 
interventions to scale“. Secondary effects of malnutrition are associ- 
ated with vulnerability to microbial pathogens that can also severely 
disrupt neurodevelopment****. 


Enteric infections 

The aetiology of malnutrition is complex. In particular, malnutrition 
might result from enteric infections of bacteria that are highly prev- 
alent in LMICs, and include both well-known (Escherichia coli, Vibrio 
cholerae, and species of Salmonella, Shigella and anaerobic streptococ- 
ci)“ and emerging pathogens (enteroaggregative E. coli, Cryptosporid- 
ium and Giardia)*’. These infections can significantly affect childhood 
brain or behavioural development, presumably through damage to the 
gut microbiota. This can lead to intestinal inflammation that dimin- 
ishes intestinal absorption, and protein and micronutrient deficien- 
cies compounded by recurring dehydration and malaise**”. This field 
of research has also significantly advanced our understanding of the 
inter-relationships between genetics (for example, neuroprotective 
APOE polymorphisms), enteric diseases, nutritional malabsorption 
and neurodevelopment in young children**”. 

An important research opportunity provided by this work involves 
the clinical evaluation of the neurodevelopmental benefits of micronu- 
trient interventions to enteric disease, including whether glutamine 
works better than glucose as a key ingredient of oral rehydration and 
repair therapy (ORRT)®. Glutamate intervention may be more effective 
in the repair of intestinal barrier functions and hence improve child de- 
velopment as well as the absorption of ARV drugs in children with HIV. 


Food-borne neurotoxins and nutritional malabsorption 

Konzo disease is a permanent, irreversible, upper-motor neuron dis- 
order, occurring primarily in rural areas of sub-Saharan Africa that are 
dependent on bitter varieties of cassava (Manihot esculenta; an annual 
crop cultivated for its edible starchy tuberous root, which is a major 
source of carbohydrates and, therefore, a food staple). Epidemiologi- 
cal studies have documented konzo outbreaks — mostly in women and 
children — in periods of food insecurity that have been brought about 
by drought, displacement by war or conflict, or other factors that have 
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BOX 1 | STRATEGIC ONGOING AND IMMINENT 
RESEARCH OPPORTUNITIES 


Strategic ongoing and imminent research opportunities at the 
intersection between brain, gene and environment (Fig. 1), which 
potentially lead to neurodisability interventions in low- and middle- 
income countries. 


Gene and brain — ongoing research opportunities 

e Neural tube defects with associated hydrocephalus and 
developmental brain anomalies. 

e Monogenic disorders (either heritable or de novo) for which 
neurodevelopmental disorders are caused by various types of 
mutations (from a point mutation such as in sickle cell disease to 
in a single gene such as in Rett syndrome). 

e Disorders due to alterations in the mitochondrial genome (for 
example, creatine deficiency syndromes). 

e Neurodevelopmental disability of unknown origin such as autism 
spectrum disorders, specific learning disabilities, attention deficit 
hyperactivity disorder and conduct disorders. 


Environment and brain — ongoing and imminent 

(shown by *) research opportunities 

e Brain injury from central nervous system infections related to 
neonatal sepsis (meningitis, ventriculitis and cerebritis) and 
resulting post-infectious hydrocephalus. 

e Toxic exposure of cyanide to young children fed cassava as a 
result of food insecurity and insufficient processing of cassava 
with high linamarin content. 

e *Evaluate neurocognitive rehabilitation interventions on brain 
development neuroprotective factors and on biomarkers of brain 
inflammation (for example, TNF-c and creatinine). 

e *Hormonal effects of maternal stress on child neuropsychiatric 
risk in utero for mothers in LMICs who have been traumatized 
through sexual violence in conflict zones (glucocorticoid- 
mediated inducer of the cytokine inflammatory responses). 

e *Chronic poverty may ‘shrink’ brains over successive generations 
as documented by magnetic resonance imaging research. 


Environment and gene — imminent research op- 

portunities 

e Maternal depression is associated with widespread changes 
in DNA methylation in their offspring that may persist into 
adulthood for exposed children. 

e Genetic vulnerability to onchocerciasis may lead to 
neuroinflammation, seizures, and profound neurodisability in 
the form of nodding disease in select vulnerable populations in 
regions highly endemic for onchocerciasis. 

e Genetic factors related to why asymptomatic positive malaria 
parasitaemia progresses to cerebral malaria (or severe malaria 
anaemia) in African children with subsequent brain injury. 

e APOE and neuroprotection for enteric diseases (with elevated 
risk for age-associated dementia types). 


led to the insufficient processing of cassava tubers. The insufficient 
breakdown of linamarin compounds that contain cyanide result in 
neurological damage and seem to lead to outbreaks of konzo, which 
has been documented mostly in the Congo, Central African Republic, 
Mozambique and Tanzania*!** with a prevalence of between 0.1% and 
17% in affected villages**. Studies have recently documented neuro- 
cognitive impairments in children with konzo. Furthermore, even 
children who do not show signs of konzo, but who live in konzo-af- 
fected households may have neurocognitive impairment of working 
memory and learning ability”. 

Konzo offers an important opportunity for integrative neurodevel- 
opmental science. Neuroinflammatory markers of brain injury from 
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cyanide toxicity and inflammatory markers of microbiota destruction 
in the gut from cyanide toxicity need to be mapped on sensitive neu- 
rocognitive impairment indicators in children. Konzo offers a rare op- 
portunity to test integrative models of nutritional toxicity in the brain 
and gut against a backdrop of malnutrition and corresponding mi- 
cronutrient deficiencies. Gauging their comparative weighting in the 
mediation of neurodevelopmental disability within the cognitive and 
neuromotor domains will allow us to determine the effectiveness of 
prevention and treatment strategies. Since konzo is entirely prevent- 
able, health education and promotion intervention methods should be 
evaluated at the community-wide level in terms of the benefit to disa- 
bility-adjusted life years®. 


TREATING HYDROCEPHALUS IN LMICS 

Hydrocephalus, the abnormal accumulation of cerebrospinal fluid in 
the cerebral ventricles, has multiple causes, and is especially prevalent 
in LMICs. Failure to treat the condition almost always leads to death or 
severe neurodevelopmental disability. Higher birth rates and limited 
perinatal care contribute to a greater burden of care for hydrocephalus 
in LMICs*” (for example, there are 100,000-250,000 new infant cases 
of hydrocephalus annually in sub-Saharan Africa alone“). In addition 
to the expected burden of congenital hydrocephalus in LMICs, cli- 
mate-driven neonatal ventriculitis of unknown pathogenesis has re- 
cently been identified as one of the chief causes of infant hydrocephalus 
(60°% of cases in Uganda)**-“. In sub-Saharan Africa, rates of neonatal 
sepsis are estimated to be 170 per 1,000 births, with a corresponding 
mortality of 10 deaths per 1,000 births’. For survivors with post-in- 
fectious hydrocephalus (PIH), neurodevelopmental consequences of 
the primary brain injury can be devastating even before hydrocephalus 
develops. One-third of those with PIH remain profoundly disabled at 
five years, even after successful surgery®*. However, innovative surgi- 
cal techniques have been pioneered and developed in Uganda that have 
revolutionized the treatment of hydrocephalus worldwide? ©. 

The standard treatment for hydrocephalus has long been the im- 
plantation of tubing that drains cerebrospinal fluid from the ventri- 
cles to the peritoneal cavity (ventriculoperitoneal shunt). However, 
this treatment creates lifelong dependence on an unreliable implanted 
device that often requires an emergency operation when it fails (40% 
failure within 2 years of the original implantation)®°. An effective, 
minimally invasive treatment method (endoscopic third ventriculo- 
stomy (ETV) combined with endoscopic choroid plexus cauterization 
(CPC)) that avoids shunt-dependence in most infants was developed 
in Uganda as an alternative’. The procedure — the safety and efficacy 
of which were demonstrated initially in LMICs and then in high-in- 
come countries®> — combines two techniques. These involve creating a 
new opening through the floor of the third ventricle and reducing the 
choroid plexus tissue in the lateral ventricles by cauterization. Building 
the capacity to achieve universal access to optimal and affordable hy- 
drocephalus treatment for infants in LMICs is an ongoing challenge”. 
Present efforts involve task shifting by training non-physician medical 
officers to undertake the shunt placement, allowing neurosurgeons to 
focus on the more complex third ventriculostomy procedures. Dissem- 
ination and implementation research is needed to test the effectiveness 
of this task-shifting approach. 


GENETIC STUDIES OF NEURODEVELOPMENT 

The genome has a substantial role in the aetiology of neurodevelop- 
mental disorders. These disorders can be classified into six major cat- 
egories (Box 2). There is abundant evidence that the disorders in all six 
categories may affect many facets of child development. The disorders 
in categories 1-3 are typically severe and impose multiple develop- 
mental challenges from birth. These conditions are referred to as con- 
genital conditions (their broad definition also includes conditions that 
result from various challenges in pregnancy, such as severe micronu- 
trient deficiency, for example folate deficiency). Given what is known 
about the prevalence of these conditions in high-income countries, 
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BOX 2 | SIX GENETIC CAUSES OF NEURODISABILITY 
IN CHILDREN 


1. Disorders caused by specific genomic lesions (either heritable or 

arising de novo). 

2. Monogenic disorders (either heritable or de novo) for which the 
disorders are caused by various types of mutations (from a point 
mutation, such as sickle cell disease, to repeat expansion) in a 
single gene (for example, mutations in the MECP2 gene in Rett 
syndrome and a number of CGG repeats in the FMR1 gene in 
fragile X syndrome). 

. Disorders due to alterations in the mitochondrial genome (for 
example, creatine deficiency syndromes). 

4. Relatively common disorders such as autism spectrum disorders, 
specific learning disabilities, attention-deficit hyperactivity 
disorder and conduct disorders. 

. Disorders triggered by the environment, but the burden of which 
is controlled by the genome. 

6. Conditions that arise from the involvement of the epigenome 

(modifications in the function of the genome that are not caused 
by any structural alterations in the genome itself). 


Ww 


wn 


estimates suggest that at least 7.6 million children are born annually 
with severe congenital conditions, and that the number is especially 
high in LMICs°. 

The disorders in categories 4-6 include common multifactorial 
conditions with onset in early childhood. Of note, less than 50% of 
countries have policies for the control of these conditions®’. These con- 
ditions currently constitute a substantial health challenge in high-in- 
come countries, but are substantially understudied, under diagnosed 
and underserved in LMICs. Although limited, the relevant research in 
LMICs unfolds in a number of dimensions, converging around the un- 
derstanding that economic development and changes in lifestyle have 
led, or are leading to, a rapid increase in the observed prevalence of 
these multifactorial disorders. In other words, as people’s environment 
improves, the role and prominence of genetic and genomic factors will 
increase. Common disorders include conditions that are attributable 
to epigenetic influences* (for example, DNA methylation and histone 
modification). 

In this context, two epigenetic mechanisms have been highlight- 
ed. The first mechanism connects nutritional challenges to the mani- 
festation of metabolic syndromes. This happens through a causal link 
between nutrient restrictions in utero and in early childhood, lack of 
clean water and sanitation, and high levels of infectious organisms in 
the environment. These can lead to epigenetic changes in pathways re- 
lated to metabolism, blood pressure and glucose regulation®. The sec- 
ond mechanism is the link between psychological stress and the gluco- 
corticoid-mediated inducer of the cytokine inflammatory response”’. 
Both exemplify the developmental origins of the health-and-disease 
hypothesis and its relevance to the aetiology of neurodevelopmental 
disability in LMICs”. 

Key research and training priorities related to these six disorder 
categories are: determining their global prevalence; training scientists 
in appropriate molecular technologies and sustaining this increased 
human-resource capacity by providing ongoing support and training 
to keep up with the rapid technological advances in the field; devel- 
oping methods for cheap and reliable diagnoses of the widest possible 
range of congenital conditions and identification of the broadest possi- 
ble range of risk factors for complex multifactorial disorders; develop- 
ing practical, accessible and inexpensive procedures for family-plan- 
ning counselling (preconception and post-delivery); and continuing to 
build the capacity and infrastructure needed to initiate cutting-edge, 
relevant research that is comparable with that taking place in high-in- 
come countries. These research and training priorities should translate 
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Epigenetics 


Figure 1 | Mutually interactive domains — environment, gene and brain — in- 
teract in terms of environment—gene socioevolutionary processes (epigenetic), 
environment-brain moment-by-moment neurocognition (neuropsychology), 
and gene-brain universal brain and behavioural processes in child neurode- 
velopment. The foundation for this multi-level interaction is brain plasticity as 
shaped by risk and resilience in child neurodevelopment, which occurs at the 
evolutionary (physical environment), cultural (social environment), individual 
(brain) and neuronal genotype (genetic) levels. The most strategic points of 
research opportunity as presented in this Review occur at the intersections be- 
tween brain, gene and environment. 


into public health services that can help couples in family planning 
and the resource mobilization needed to nurture children with such 
disorders. 


CONCLUSION 

We have outlined significant scientific findings and challenges that 
have emerged from research in LMICs. We have provided strategic re- 
search examples and areas of research opportunity (aetiology and in- 
tervention) at the junction between the environment, brain and gene. 
The dynamic interactions among these three domains are at the foun- 
dation of brain neurodevelopment in children (Box 1). New technolo- 
gies are providing ever more sensitive biomarkers that can be related to 
the brain and behavioural neurodevelopmental integrity of the child. 
New technologies are also emerging that link the regional and global 
surveillance of neurodisability to environmental risk, and these can 
be integrated with the genetic and epigenetic underpinnings that drive 
this relationship. 

Future approaches must accommodate the use of new data gath- 
ered by innovative technologies, offering fresh approaches to old prob- 
lems in child development in LMICs. These new approaches will prove 
to be especially strategic at the points of interface and integration be- 
tween the environment, gene and brain (Fig. 1). New models that can 
effectively integrate these three domains into a comprehensive and co- 
hesive paradigm must have the following hallmarks. 


Interdisciplinary approaches 

Research in LMICs needs to take into account the complex multifac- 
torial causes of neurodisability (Fig. 1). Environmental risk from nat- 
ural disasters, social unrest, poverty and infection can offset a child’s 
neurodevelopmental trajectory at the points of interface between gene 
and brain, during the antenatal and postnatal stages of development. 
In addition to toxins from the diet such as cassava-based cyanide 
in konzo, environmental toxins from mining and biomass fuels are 
leading to levels of exposure that can affect child neurodevelopment 
across entire communities. These complex systems of developmental 
risk factors call for comprehensive interdisciplinary approaches to 
understand the developmental trajectories so as to identify children 
at serious risk of neurodisability and those in need of intervention 
in LMICs. For example, maternal health programmes need to work 
closely with early childhood programmes to ensure an optimal prena- 
tal environment for the developing foetus, improved pregnancy out- 
comes, and effective parental and community-wide interventions to 
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enhance child development. Another example is the engagement of 
parents in child-awareness programmes to facilitate their cognitive 
and socioemotional development. We have cited examples of specific 
family-based” and community-based” interventions that have been 
successfully used in LMICs. Such interventions involve the integration 
of anthropology, public health education and promotion, social and 
media science, and developmental paediatric research. 


Employment of new technologies 

New technologies are enhancing research approaches in LMICs, pre- 
senting enormous potential to transform health-care delivery. The 
development of new mobile technologies for surveillance, assessment 
and treatment are particularly needed in LMICs, where mobile phone 
ownership is rapidly rising. Computerized interventions are already 
being used for the treatment of children with cerebral malaria and HIV. 
New and improved surgical techniques will be crucial for saving lives 
and altering atypical developmental trajectories. The miniaturization 
of diagnostic technologies that provide data for integrative risk maps, 
which can be integrated at the population level with genome distribu- 
tions, will allow for effective population surveillance and public health 
intervention at a community-wide level. 


Implementation research 

Research in LMICs cannot be divorced from the health systems and 
the cultural context in which the populations are situated. Research on 
the implementation of evidence-based prevention and intervention is 
needed. Scientifically sound interventions scaled up to the commu- 
nity and national level will require working with governmental and 
non-governmental partners to ensure sustainability. As already noted, 
significant advances have been made by ‘task shifting’ in resource-con- 
strained settings in order to, for example, delegate health-care tasks to 
health workers with lower qualifications. Such strategies have shown 
especially promising results in dealing with mental health gaps, but 
can be effectively applied for the rehabilitative care of neurodisability 
in children”. Task-shifting strategies, however, need to be evaluated for 
approaches within dissemination and implementation science. It is only 
in this manner that we will achieve the UN moral and legal mandate of 
full childhood neurodevelopment as a basic human right for all. 
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A focus on adolescence to reduce neurological, 
mental health and substance-use disability 
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Globally, there is a crucial need to prioritize research directed at reducing neurological, mental health and substance-use disor- 
ders in adolescence, which is a pivotal age for the development of self-control and regulation. In adolescence, behaviour optimal- 
ly advances towards adaptive long-term goals and suppresses conflicting maladaptive short-lived urges to balance impulsivity, 
exploration and defiance, while establishing effective societal participation. When self-control fails to develop, violence, injury 
and neurological, mental health and substance-use disorders can result, further challenging the development of self-regulation 
and impeding the transition to a productive adulthood. Adolescent outcomes, positive and negative, arise from both a life-course 
perspective and within a socioecological framework. Little is known about the emergence of self-control and regulation in ado- 
lescents in low- and middle-income countries where enormous environmental threats are more common (for example, poverty, 
war, local conflicts, sex trafficking and slavery, early marriage and/or pregnancy, and the absence of adequate access to educa- 
tion) than in high-income countries and can threaten optimal neurodevelopment. Research must develop or adapt appropriate 
assessments of adolescent ability and disability, social inclusion and exclusion, normative development, and neurological, mental 
health and substance-use disorders. Socioecological challenges in low- and middle-income countries require innovative strate- 
gies to prevent mental health, neurological and substance-use disorders and develop effective interventions for adolescents at 


risk, especially those already living with these disorders and the consequent disability. 
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dolescent neurological, mental health and substance-use 

(NMS) disorders in low- and middle-income countries (LMICs) 

must be addressed to ensure optimal development of ‘human 
capital’ for the future'. Adolescents, defined by the World Health Or- 
ganization (WHO) as those between 10 and 19 years of age, represent 
an estimated 1.2 billion of the world’s population’. During these years, 
cognitive development continues to unfold, socioemotional devel- 
opment advances dramatically, and a constellation of NMS disorders 
reaches a peak during adolescence and early adulthood? (Fig. 1). Re- 
search and practice in LMICs have focused on children under five years 
old, whereas adolescents are rarely prioritized. 

Indeed, more than 15% of disability-adjusted life years (DALYs) are 
attributable to adolescents and young adults (20-24 years old), with 
DALY rates in Africa 2.5 times that of high-income countries®. Global- 
ly, the primary causes of years lost due to disability (YLD) for adoles- 
cents include neuropsychiatric disorders (45%), unintentional injuries 
(12%) and infectious diseases (10°%)°. Up to 20% of young adults have 
a disabling mental illness, and up to 50% of adult mental health disor- 
ders experience their onset in adolescence®. Adolescence has received 
more attention since being made a high priority by the United Nations 
programme UNICEF and other agencies!”, facilitating opportunities for 
prevention of NMS disorders and support for adolescents already living 


with these conditions (Box 1). 

Adolescence is a crucial period of brain development that leads to 
increased self-regulation. However, concomitant impulsiveness can 
lead to risky behaviours that result in impaired cognitive or emotional 
development, lifelong disability and even death. These behaviours and 
challenging environmental exposures often interact over time, com- 
pounding their effects. For example, early initiation of substance misuse 
might lead to impaired cognitive and affective development, later addic- 
tion, brain injury and other health-related disorders such as HIV/AIDS®. 

The impact of NMS disorders on adolescents is best understood 
from a life-course perspective. In this Review, we highlight only the 
causes of NMS disorders that tend to first appear during adolescence. 
We consider risk and resilience through a socioecological model in 
which an adolescent with a particular genome undergoes physical 
and hormonal transformations within the context of a family, peers, 
school, work, community and culture all of which can be instrumen- 
tal in determining adolescent NMS outcomes. Few studies have been 
conducted using parallel methods across high-income countries and 
LMICs, and since it is not clear to what extent findings in high-in- 
come countries can be generalized to LMICs, we suggest that there is 
a need for comparative research using parallel methods. This Review 
focuses on research priorities to reduce the most important threats to 
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Figure 1 | Disability-adjusted life years (DALYs) for each neurological, mental 
health and substance-use (NMS) disorder in 2010 by age’. Note the rise of NMS 
disorders in late childhood and adolescence, particularly depression, anxiety, 
alcohol and other drug-use disorders. Reprinted with permission from ref. 4. 


adolescents in LMICs and contains suggestions to improve methodol- 
ogy and a brief list of high-impact research priorities (Box 2). We also 
suggest a range of broader research opportunities, priorities and chal- 
lenges, including research capacity building (Supplementary Table 1). 


NEURODEVELOPMENT AND NMS DISORDERS 
Adolescence is a time of profound change as young adults cope with 
increasing independence and the growing importance of social and 
sexual relationships, while simultaneously developing and exercising 
self-control. It is unknown whether hormonal, cognitive, affective and 
societal transitions from childhood to adolescence occur in a similar 
time frame and sequence in LMICs as they do in high-income coun- 
tries, or whether any differences in timing or sequencing affect neural 
development. The likelihood that an adolescent will develop a specific 
NMS- related disability depends to some extent on the balance between 
higher order cognitive function (known as executive control) and en- 
vironmental pressures, societal expectations and impulsivity, and the 
way in which the individual processes emotional, social and behav- 
ioural cues’? (Fig. 2). The impact of adolescent NMS disorders can be 
exacerbated by early exposure to risk factors that are more prevalent in 
LMICs (for example, malnutrition, infectious disease, abuse, neglect, 
internal conflict or war, early pregnancy and marriage, and adolescent 
labour). Consequently, adolescents in LMICs may be at a higher risk 
than their counterparts in high-income countries. 

The recent Well-Being of Adolescents in Vulnerable Environments 
study used identical methods to study adolescents from disadvantaged 
areas in five cities: Baltimore, Shanghai, Delhi, Ibadan and Johannes- 
burg. Neighbourhood physical and social exposures were associated 
with adolescent disorders in LMICs and high-income countries, al- 
though patterns and degree of association varied. The authors con- 
cluded that “for young people growing up in poverty, residency in a 
high-income country may matter far less than the immediate social 
contexts within which they develop and grow”. There is also evi- 
dence that in settings where socioeconomic disparities are great that 
depression is especially high compared with places where poverty is 
almost universal. Such findings illustrate the need for longitudinal 
and cross-sectional multi-country studies using comparable methods 
across differing levels of economic and social development. 


PREVENTION OF NMS DISABILITY 

An overview of research priorities for prevention of NMS disability is 
presented in Supplementary Table 1. Adolescents and young adults ex- 
perience the highest amounts of violence, both towards self (self-harm 
and suicide) and others (partner violence, delinquent behaviour, crim- 
inal acts and war-related trauma)?. In addition, adolescents are at in- 
creased risk of a range of health problems that can lead to NMS disability 
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BOX 1| HUMAN RIGHTS AND ADOLESCENT 
RESEARCH 


Health condition 
(disorder or disease) 


os vue al ~—— 
& structures 
Environmental Personal 
factors factors 


Two major international conventions and a new classification 
system transformed opportunities to support optimal development 
for children and adolescents living with neurological, mental health 
and substance-use disorders and for the primary prevention of 
disabling disorders. The Convention on the Rights of the Child 
(which includes adolescents) and the Convention on the Rights 

of People with Disabilities hold nations responsible for ensuring 
optimal developmental outcomes for children and adolescents and 
guarantee rights for those living with disabilities. These rights also 
commit countries to support adolescents in social inclusion and 
participation in society, and reduce stigma. 


The International Classification of Functioning, Disability and 
Health for Children and Youth (ICF-CY) takes account of the 
interaction between the biology of a given individual and the 
specific external challenges or supports affecting an individual's 
ability to function. It also acknowledges societal structures 
(environmental factors) that decrease or increase barriers to 
participation and inclusion”’. The ICF-CY, outlined in the Figure, 
relies on concepts of disability rather than disorder-specific 
pathology or morbidity and incorporates a socioecological 
framework. This highlights the importance of environmental 
context alongside genetic risk as a child develops through 
adolescence to adulthood. 


such as HIV/AIDS, meningitis, and trauma due to civil and interpersonal 
violence, as well as unintentional injury such as road traffic accidents’. 


Gender inequality, education, pregnancy and violence 
Increases in gender inequality during adolescence are evident in many 
LMICs, with a higher proportion of boys than girls completing second- 
ary school. This directly results in a substantial gender gap in education 
and literacy rates? and indirectly in differential opportunities for the 
development of executive function, self-regulation and employabili- 
ty. Intimate partner violence (IPV) against women also peaks during 
adolescence and is a known risk factor for acquiring HIV as well as 
brain trauma, post traumatic stress disorder (PTSD) and depression. 
The WHO multicountry study found that in women aged 15-24 who 
have ever had a romantic or sexual partner, the prevalence of IPV ex- 
perience before the age of 15 ranged from 19% to 64°%, with most sites 
reporting that this occurred in more than 50% of participants. There is 
little information on NMS outcomes in adolescent women experiencing 
IPV in LMICs, although in high-income countries, PTSD, depression 
and substance misuse are frequently associated with IPV™. Girls are at 
higher risk of HIV and AIDS, whereas boys are more at risk of civil and 
interpersonal violence’. 
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Adolescent mothers in high-income countries are particularly at 
risk of developing depression: prenatal, postnatal and, in one study, 
long-term depression”, but we found no research on the impact of ad- 
olescent pregnancy on depression in LMICs. The impact of adolescent 
pregnancy on maternal cognitive development has not been studied in 
any context and represents a global research priority. This is especially 
relevant in LMICs where 20°% of mothers have had their first child by 
the age of 18 and more than 20% of girls are married before age 18, 
despite a ban based on international conventions". 


Community and conflict-related violence and injury 
Adolescence is a dynamic time of biological (brain and sexual matu- 
ration) and social development (differentiating self from others and 
appraising the self by forming self-control, self-esteem and self-effi- 
cacy). Adolescents are concomitantly more likely to break social rules 
and experience aggression towards self and others’. The peak age of 
both committing and receiving violent acts is between 14 and 19 years 
old — adolescent males are more frequently the perpetrators and vic- 
tims than are females’. 

In wars, despite international agreements and conventions to the 
contrary, child soldiers are pressed into service at a young age, expe- 
riencing disrupted social and emotional development and interrupted 
schooling. These adolescents are at risk of physical trauma with last- 
ing disability, PTSD and addiction'*. Similar outcomes are found from 
the impact of post-conflict displacement of adolescents, who are often 
separated from their family and community”. However, there is evi- 
dence that negative impacts can be mitigated by stable and support- 
ive social environments. A study of 880 Bosnian adolescents demon- 
strated the need for better methodological approaches to unpack the 
multitiered pathways that lead to trauma in order to develop better 
interventions’. A review of interventions providing mental health 
support to conflict-affected children and adolescents found that most 
were school-based programmes”. There were few family- or commu- 
nity-based programmes and only two multilevel programmes. A sys- 
tematic review of interventions for refugee children found eight studies 
in LMICs (seven in refugee camps) that showed that treatments, in- 
cluding cognitive behavioural therapy (CBT) and narrative exposure 
therapy, were successful in reducing psychological problems such as 
depression and PTSD”. 

There is limited evidence that interventions developed in high-in- 
come countries can be transplanted successfully~?’. It has been ar- 
gued” that the most effective strategies for preventing violence should 
promote mental health and be delivered to all young people, thus 
avoiding stigmatization and attracting broad community support. 
Yet, the evidence base for effective programmes to prevent violence 
in LMICs is almost non-existent*°. A recent review of interventions to 
support street children found 12 studies in high-income countries, but 
did not find any adequately robust interventions in LMICs despite the 
plethora of programmes that are being implemented”. 

Road traffic accidents and other transport-related injuries are the 
leading cause of brain trauma and spinal cord injury, death and dis- 
ability, disproportionately affecting young adults and accounting for 
about 5°% of all DALYs, which is only exceeded by unipolar depression®. 


Mental health and substance use 
Difficulties in the development of executive control in adolescence can 
lead to a lack of balance in the regulation of cognition, emotions and 
behaviour when dealing with intrusive negative thoughts and feelings. 
This may result in depression or anxiety. Such affective disorders often 
have their onset in adolescence (Fig. 1) when these disorders are more 
likely to become severe and disabling’. The prevalence of depression 
in adolescent girls is substantially higher globally than that in boys’. 
There is some evidence from high-income countries that early inter- 
vention to reduce the duration of first episodes of depression can re- 
duce later recurrence”; this needs to be evaluated in LMICs. 

The first onset of schizophrenia often arises in adolescence or 
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early adult life (Fig. 1). Early identification of psychosis by a range of 
non-specialists followed by intervention may mitigate the severity of 
the disease and the need for hospitalization”. Family approaches have 
also shown promise for young adults at high risk??. However, differ- 
ences between high-income countries and LMICs in risk factors for, 
and the effective prevention and treatment of, psychoses are poorly 
understood”. 

Adolescents can be at increased risk of developing substance-use 
disorders owing to poor self-control and impulsiveness. Although not 
well studied in LMICs, substance-misuse is often initiated in adoles- 
cence and accounts for a substantial proportion of the disability burden 
faced by adolescents*". It also contributes to other major causes of dis- 
ability such as unintentional injuries and violence’. Alcohol use, par- 
ticularly binge drinking, has been shown to inhibit neuronal develop- 
ment in adolescence” and has a greater impact on motor and executive 
impairment in adolescents than in adults, thus conferring greater risk 
of injury or risky behaviour in adolescents*®. 

In Russia, excessive alcohol consumption is a major cause of pre- 
mature death*™ and is associated with early initiation and frequent al- 
cohol consumption in adolescence*. Evidence-based interventions to 
delay onset of drinking and reduce binge drinking in adolescents in 
LMICs are a high priority. A related priority calls for the inclusion of 
measures of depression, violence and sexually transmitted diseases in 
research aimed at diminishing the impact of either early initiation of 
drinking or of binge drinking in adolescence. 


Infectious diseases 

Vaccine-preventable neurological disorders may result in cognitive 
disability, epilepsy, motor disorders, and hearing and/or vision loss. 
For example, meningitis A is prevalent in sub-Saharan Africa and tar- 
gets adolescents and young adults, causing death, cognitive disabil- 
ity and hearing loss. WHO and PATH have developed a new vaccine 
against meningitis A that has been introduced in a number of African 
countries. More than 100 million doses have been delivered to people 
between 1 and 29 years of age, contributing to the lowest incidence of 
the disease in 10 years**. The use of implementation science to further 
scale-up this intervention would eliminate a major cause of hearing 
loss and cognitive disability in African adolescents. 

Untreated HIV infection is associated with disabling cognitive 
impairment, depression and behavioural disorders in adolescence*’. 
Following the scale-up of antiretroviral (ARV) therapy, survival of 
perinatally infected children in sub-Saharan Africa has dramatically 
improved. The population of adolescents with HIV is now estimated at 
2 million with more than 90% of those living in sub-Saharan Africa. 
Young adults infected with HIV are at increased risk of developmental 
and neuropsychological disturbances, which seriously undermine ac- 
ademic and social achievement?”**. A study in South Africa evaluated 
a community participatory approach to adapt a family-based inter- 
vention (originally developed in the United States) to promote mental 
health awareness in adolescents receiving HIV ARV treatment?’. Short- 
term results showed improvements in mental health, behaviour and 
adherence to the drug regimen with a decrease in stigma experience’. 
Peer-led programmes are now part of many large-scale initiatives to 
reduce HIV risk in young adults, and a systematic review found that 
they are effective in increasing knowledge, particularly of HIV preven- 
tion approaches and transmission routes and in increasing condom use 
in LMICs, but evidence of changes in sexual behaviour and STI rates 
are not conclusive’. Research to determine the effectiveness, gener- 
alizability and the long-term impact of such interventions to support 
adolescent development is a priority. 


Research in developing educational initiatives 

An important challenge to adolescents’ development in LMICs is the 
low rate of secondary education completion’. A review of competen- 
cies needed by vulnerable young adults as a result of war, homeless- 
ness and child labour concluded that these are not the same as those 
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conventionally required in the education of adolescents in high-in- 
come countries, but posits that the process by which they are acquired 
is universal'*. Research is needed for the development, evaluation and 
delivery of necessary competencies in secondary education in LMICs. 
Such innovative approaches would add to standard interventions to 
improve health, mental health and social outcomes. Such programmes 
would focus not only on the acquisition of skills and knowledge, but 
also on the development of cognitive and motivational skills that are 
central to the emergence of self-regulated learning. 


ADOLESCENTS WITH DISABILITIES RESEARCH 

There are few studies that evaluate the approaches that support LMIC 
adolescents living with disabilities. A systematic review of 22 school- 
based interventions in LMICs found promising support for mental 
health promotion and some evidence that interventions to prevent and 
to treat mental health disorders are effective*’. The study identified 
a series of research gaps that need to be filled to understand how to 
further evaluate and bring to scale these school-based interventions. 
These include the need to focus on current stressors such as social ex- 
clusion and domestic violence, and to test interventions as part of a 
multilayered societal programme with an emphasis on task shifting 
(use of trained mid-level researchers in place of clinical specialists). 

A priority area for research is to develop and evaluate programmes 
to ensure that adolescents living with an NMS disability are not exclud- 
ed from access to social, educational and health services that would re- 
duce secondary and tertiary development of disability and ensure opti- 
mal development. Interventions that increase independence must also 
be developed and tested to protect adolescents from violence, and to 
ensure educational parity and an effective transition to adult services". 


METHODOLOGICAL ISSUES 

It is often challenging to include adolescents in research. With the in- 
creasing independence of adolescents, their parents may not have ac- 
curate information regarding exposures and even outcomes. The home 
may no longer be a feasible setting for any study that requires either a 
physical examination or biological samples. However, recent studies 
have shown that providing support to carers and communities that as- 
sist vulnerable adolescents facing HIV infection can mitigate obstacles 
in reaching such groups*”’. Studies based in schools are often used, 
although such approaches may only work for younger (10-13 year olds) 
populations because most LMICs adolescents are no longer in school. 
According to UNICEF? only 61% of boys and 49°% of girls entered sec- 
ondary school in LMICs. One successful approach to reach vulnerable 
or marginalized urban adolescents is respondent-driven sampling’. 


Disaggregation of age group 

Researchers should collect data to allow disaggregation by age to fa- 
cilitate, at low additional cost, age-specific estimates of exposures 
and outcomes. Often research data on adolescents are combined 
with either children or adults*. A systematic review, including stud- 
ies from LMICs and high-income countries, demonstrated that the 
highest prevalence of intellectual disability was among children and 
adolescents compared with adults, and within LMICs compared with 
high-income countries’. However, it was impossible to disaggregate 
adolescents from children or adults. As an example of the importance 
of disaggregation*’, a peer support-group intervention to reduce the 
impact of stigma in those living with epilepsy in Zambia found a sig- 
nificant effect in adolescents, but not in adults. 


Promoting and funding longitudinal studies 

Major birth cohort studies in high-income countries have provided in- 
formation on risks and resilience across the life course, but these are 
not comparable for LMICs. There are several long-standing birth cohort 
studies in LMICs such as those in Guatemala, India, Brazil, the Philip- 
pines and South Africa**. The cohorts in such studies should be support- 
ed over the life course and the studies should be further developed to 
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BOX 2 | RECOMMENDATIONS FOR RESEARCH 


Recommendations for methodological priorities for research to 

reduce neurological, mental health and substance-use (NMS) 

disorders in adolescents in low- and middle-income countries 

(LMICs). 

e Disaggregate adolescent age group in both child and adult 
studies 

e Promote, develop and fund longitudinal cohort studies in LMICs, 
including observational and follow-up of interventions over the 
long term 

e Initiate cross-nation studies of interventions to promote 
resilience in adolescents 

e Include constellations of exposures and outcomes in studies that 
involve adolescents and avoid focusing on a single exposure or 
outcome 

e Use methodological approaches to explore multisectoral and 
multilevel pathways leading to NMS disorders in traumatized 
young adults 

e Include measures of disability and participation in studies of 
adolescents 

e Use new screening measures that include adolescence 
developed for national studies (UNICEF and Washington Group 
on Disability Statistics) 

e Link development of research priorities to emerging findings 
from national studies 

e Train mid-level researchers (nurses, teachers and others) to 
conduct research, including assessment of NMS disorders where 
shown to be effective 


High impact priorities for research to reduce NMS disorders in 

adolescents in LMICs. 

e Develop and evaluate strategies to provide mental health 
interventions for adolescents in community and school settings 
and address gaps 

e Develop an evidence base for interventions designed to prevent 
violence in young adults 

e Investigate the impact of early child bearing on adolescent 
mothers on development of executive function 

e Develop innovative approaches to provide appropriate 
secondary education for all adolescents 

e Develop innovative approaches to ensure inclusive education for 
those living with disabilities 

e Develop and evaluate approaches to protect those living with 
disabilities from violence and to ensure participation in their 
communities and cultures 

e Support evaluation of interventions to support the transition 
between adolescent and adult life for those living with NMS 
disabilities in LMICs 


address the research questions that are central to preventing or reduc- 
ing the impact of adolescent NMS disorders and their consequences. 


Broadening outcomes measures 

Since there is a constellation of exposures that put adolescents at risk 
of NMS disorders (many of which may simultaneously be NMS out- 
comes of other exposures), research into NMS disorders in adolescence 
should, ideally, not focus on a single exposure or outcome, but should 
measure multiple factors, whether exposure or outcome. In order to 
avoid too narrow a sectoral focus, researchers should include a summa- 
ry measure of disability in addition to measures specific to their study 
question and consider environmental exposures across education, so- 
cial development and health, using the framework of the International 
Classification of Functioning, Disability and Health for Children and 
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Figure 2 | The interaction between ‘top down’ executive function and ‘bottom up’ impulsive and emotional processes in regulating social, cultural and biological chal- 


lenges of adolescent development with a range of outcomes? ”. 


Youth (ICF-CY)4’. To accomplish this cross-cultural age-appropriate 
measures of disability, environmental risk and social participation 
need to be developed and validated to assess the balance between the 
developing abilities of the adolescent within the context of the barriers 
faced in their own communities and culture. 

Developing research capacity to measure adolescent NMS out- 
comes beyond mortality and conventional morbidity measures is es- 
sential. UNICEF and the Washington Group on Disability Statistics col- 
laborated to extend internationally appropriate screening measures of 
disability to include the adolescents. The WHO has developed a child 
and adolescent form of the brief WHO Disability Assessment Sched- 
ule**, which is designed to provide information regarding the function 
of young adults living with disabilities, including NMS disorders. This 
schedule uses the ICF-CY to provide information on external supports 
and barriers, and societal participation. It is a priority to include these 
measures in national surveys and in research studies, to provide both 
comparative data and population estimates of adolescent disability, 
which can inform national strategies. 


CONCLUSION 

Because of the range of environmental challenges that are more com- 
mon in LMICs, adolescents are at particular risk of developing neu- 
rocognitive deficits and disabilities, and mental health problems that 
limit their reasoning abilities, life-management skills and employ- 
ability. These are often related to the poor development of executive 
function or socioemotional development caused by traumatic brain in- 
jury due to war or interpersonal conflict, domestic violence or abuse, 
substance use, or infectious diseases. These largely unmeasured NMS 
disorders cannot be prevented or treated by an under-resourced health 
and educational infrastructure. Therefore, it is imperative to develop 
neurodevelopmental and psychiatric screening strategies to identify 
and assess these young adults. These will make it possible to evaluate 
the effectiveness of prevention and early intervention programmes for 
NMS disorders. Along with screening and clinical assessment for neu- 
rocognitive and neuropsychiatric disorders in adolescents at high risk, 
it is crucial to provide comprehensive access to effective rehabilitative 
neurocognitive and psychosocial interventions. A scaffold of universal 
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supports for those at high risk of NMS and with disability will also re- 
quire evaluation of more intensive interventions for those with signif- 
icant disorders. 

The recent extension of screening measures to include NMS disabil- 
ity in adolescents for use in national or regional surveys provides new 
opportunities for prioritizing research and directing funding streams 
in LMICs, and provide the data on which programmes and policies can 
be based. 

Adolescence is not simply a phase between childhood and adult- 
hood. Research to prevent and ameliorate NMS disorders in LMICs must 
bring an understanding of the interlocking neurobiological and social 
factors that challenge young adults, and recognize that the patterns of 
risk and resilience may differ from those in high-income countries. 
Research must investigate approaches to prevent toxic exposures such 
as community violence, sex trafficking, drug addiction, poverty and 
the harm caused by war. Research should develop and evaluate inter- 
ventions to prevent the emergence of disability from NMS disorders 
in adolescents in LMICs, as well as to develop and test interventions 
to support adolescents living with disability to ensure appropriate 
schooling, employment and the opportunity to become productive 
members of society, with a rewarding quality of life. 
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The resiliency of the adult nervous system is markedly affected by the environment and the circumstances during infant and 
child development. As such, adults in resource-limited settings who may have experienced early deprivation are particularly 
vulnerable to subsequent neurological disorders. Adult populations in countries with relatively recent advances in economic 
development may still have a higher susceptibility to neurological illness or injury that is reflective of the socioeconomic environ- 
ment that was present during that population’s infancy and childhood. Brain and peripheral nervous system research conducted 
over the past decade in resource-limited settings has led to an impressive and growing body of knowledge that informs our un- 
derstanding of neurological function and dysfunction, independent of geography. Neurological conditions feature prominently 
in the burgeoning epidemic of non-communicable diseases facing low- and middle-income countries. Neurological research 
in these countries is needed to address this burden of disease. Although the burden of more prevalent and severe neurological 
disease poses public health and clinical challenges in settings with limited neurological expertise, the same factors, along with 
genetic heterogeneity and the relative absence of ingrained clinical care practices, offer circumstances well-suited for the con- 


duct of crucial future research that is globally relevant. 
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eurological disorders represent a significant proportion of the 

burden of disease among adults in low- and middle-income 

countries (LMICs). Stroke, epilepsy and dementia rank among 
the highest causes of death and disability, and often affect work- 
ing-age adults. High rates of premature death and disability undoubt- 
edly constrain societies that are striving to advance human develop- 
ment through improvements in health and a more stable economy. 
Research aimed at elucidating the pathophysiology and epidemiology 
of adult nervous system disorders across a broad spectrum of settings 
has advanced our capacity to develop interventions at the population 
and individual level. Insights gained from developed countries have 
provided remarkable evidence that deprivation in early life imparts ad- 
ditional health-related vulnerabilities that extend into adulthood. This 
added susceptibility is an important and generally unmeasured factor 
that could account for the poor health status among LMICs adults. The 
burden of non-communicable diseases continues to escalate in LMICs 
that are still struggling with infectious diseases and maternal health 
challenges; the need to identify opportunities for optimizing health in 
older populations is increasingly urgent. In this Review, we detail an 
overview of some of the insights gained from research in diverse aca- 
demic, laboratory and community settings and consider what knowl- 
edge can be gained from investments in future endeavours. 


EARLY EXPOSURE AND THE ADULT NERVOUS SYSTEM 
Neurological disorders of infancy, childhood and adolescence are dis- 
cussed elsewhere in this collection (see page S161), but if survived, 


neurological injury in early life may result in adult disability, vulner- 
ability to disease and premature mortality. Therefore, it is crucial to 
integrate a lifespan approach as we examine what we currently know, 
as well as research priorities for the future. Central nervous system 
injuries, infections and infestations during childhood that result in 
epilepsy, behavioural disorders and intellectual disabilities lay the 
foundation for a compromised adulthood. Furthermore, the life course 
and exposures of childhood determine adult neurological health, with 
childhood deprivation and adolescent injury among the primary risk 
factors for adult neurological disorders in LMICs. For example, child- 
hood risk factors for epilepsy in sub-Saharan Africa include labour 
and delivery complications and lower maternal education levels*. Data 
from 1921 and 1936 birth cohorts in Aberdeen, UK, suggest that early 
nutritional deprivation predisposes people to dementia in later life°. 
Rheumatic heart disease in childhood, which is largely a result of poor 
access to health care and services, is a common cause of stroke among 
young adults in Southeast Asia and Africa*. 

Although there is a well-established association between pov- 
erty and vulnerability to neurological disorders®, shockingly little is 
known regarding the pathophysiological processes that drive such 
links — even for exposures as ubiquitous as childhood malnutrition’®. 
Given the substantial effect that childhood health and the environ- 
ment (in the broadest sense) have on adult neurological health, lon- 
gitudinal population-based studies of countries that have (and have 
not) met key United Nation’s Millennial Development Goals are war- 
ranted. In the future, these experiments in poverty reduction over 
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relatively short time periods may also offer opportunities to formally 
evaluate well-developed theories regarding epigenetic responses to 
environmental change and the fetal origins of adult neurological con- 
ditions, particularly life-long cognitive impairment, and later onset 
dementia’. 

Although malnutrition remains the principle nutritional challenge 
in child health in low-income countries, childhood obesity is reach- 
ing epic proportions in middle-income regions. Prevalence rates in 
South Africa (15%), Mexico (25%) and Brazil (23%), reflect the avail- 
ability of relatively cheap, high-energy foods and increasingly seden- 
tary lifestyles. The ongoing childhood obesity epidemic has resulted 
in unprecedented prevalence rates of hypertension among children 
and adolescents!’. High body mass index, hypercholesterolemia and 
hypertension in childhood are associated with higher carotid artery 
intima-medial thickness in young adulthood — an established risk 
factor of cardiovascular disease and stroke!!. Moreover, childhood 
hypertension is associated with hypertension as an adult”. Other 
adult cardiovascular risk factors include those that occur early in life 
with long-term consequences. Passive tobacco exposure in childhood 
causes lower high-density lipoprotein cholesterol’ and childhood 
smoking is associated with greater carotid intima-medial thickness as 
an adult, even if the smoking habit ends before maturity". Although 
exposure to established stroke risk factors from childhood undoubt- 
edly increases an individual’s overall risk of adult cerebrovascular dis- 
ease, longitudinal studies are needed to quantify the impact of today’s 
childhood obesity problem on the adult population of tomorrow. 


LATER LIFE DISEASE AND ADVANCES IN NEUROSCIENCE 
Although risk factors for adult neurological disorders can occur in isola- 
tion in childhood, if problems such as obesity, tobacco use and hyperten- 
sion begin in childhood or adolescence they often continue into adult- 
hood. Cardiovascular risk factors and illiteracy play a major part in the 
growing burden of dementia in LMICs®. Without successful programmes 
aimed at poverty reduction and population-based interventions that 
curb harmful behaviours and encourage healthier lifestyles, the global 
prevalence of neurological disorders is likely to increase substantially. 

The growing body of neurological research from LMICs has contrib- 
uted substantially to, and at times transformed, our understanding of 
numerous neurological disorders, or the pathophysiological processes 
surrounding them. These contributions include observational'® and ep- 
idemiological®> studies, health-services research’’, and clinical trials'*, 
as well as laboratory-based sciences”. Many of these advances address 
neurological disorders that are common in both developed and less-de- 
veloped settings”®. Other contributions are focused on disorders more 
limited in geographical scope”, but these too offer advances on which 
future research can build. 


Advances in neuroscience basic sciences 

The mysteries of neurodegeneration are being revealed through re- 
search collaborations between US investigators and LMIC scientists. 
Researchers from the University of Antioquia in Colombia have shown 
that viral vectors for RNA interference of cyclin-dependent kinase 5 
can reduce neurofibrillary tangles in transgenic Alzheimer’s disease 
mice”, and ongoing investigations are aimed at identifying human 
genes that modify the age at onset of Alzheimer’s’. Researchers at 
New York University and the University of Uruguay are using salmo- 
nella~based Afs to develop a vaccine for Alzheimer’s*. Argentinian 
investigators are also using recombinant adenoviral vectors and have 
shown that insulin-like growth factor (IGF) I gene therapy can restore 
hypothalamic dopaminergic function in senile rats*, and that IGF also 
improves motor function’. Collaborative studies often between re- 
searchers from multiple countries are contributing to our understand- 
ing of the genetic mechanisms of Parkinson’s disease and stroke’. 
Further research of this nature is ongoing through the National Insti- 
tutes of Health’s H3Africa Initiative with a particular focus on stroke 
genomics in Africa. 
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Epidemiological studies. Epidemiological studies are well represented 
among those that contribute to our understanding of the global burden 
of nervous system disorders. Often initial studies offer more questions 
than answers, meaning subsequent investigations are required. For 
example, previous epidemiological estimates from developing regions 
persistently illustrated a ‘gap’ in epilepsy prevalence versus the 
lifetime prevalence rates, indicating that either people with epilepsy 
in developing countries have a higher remission rate or that they have 
higher rates of premature mortality than their counterparts in wealthier 
settings. As part of the International League against Epilepsy’s China 
demonstration project, a prospective study of people with epilepsy in 
rural China demonstrated high rates of prime adult, seizure-related 
mortality despite readily accessible treatment”’, indicating that this gap 
is from mortality and not disease remission. Multinational collaborations 
currently underway are elucidating the neurotoxicity of pesticides on 
both passively exposed children and adult agricultural workers”. 
Although combined antiretroviral therapy is facilitating the transition 
of HIV from a fatal disease to a chronic condition, neurological disability 
remains a significant problem for people with HIV worldwide*’. HIV-as- 
sociated neurocognitive disorders (HANDs) are evident in almost half of 
those with HIV in the United States*. An international group working on 
the clade-specific effects of HIV on the central nervous system has gained 
important insights into the pathophysiology and prevention of HANDs. 
Studies in Africa and Latin America identified that HIV-associated neu- 
ropathy prevalence rates in those countries are substantially greater than 
in similar US populations, and point towards a central role for nutritional 
factors*. The studies clearly illustrate the need for further investigations 
aimed at directing future interventions. Interventions to optimize the 
cognitive outcomes for people with HIV in regions heavily affected by 
the disease will be needed to further limit the impact of HIV on human 
resources in LMICs. 


Implementation science. Research across the life course is needed for 
neurological disorders, but translating the knowledge from research 
into action requires evidence-based approaches to implementation 
that are locally relevant. For example, researchers studying food-re- 
lated neurotoxicity have successfully identified safer, feasible food 
preparatory measures, which have been taken up by communities*®. 
Epilepsy is one of the commonest chronic neurological conditions 

in all settings regardless of economic status. The epilepsy commu- 
nity has called for improved screening and integrated care for the 
common psychiatric co-morbidities that often devastate people with 
epilepsy. Barriers to developmental and interventional studies that 
would facilitate this undertaking include time constraints, limited 
reimbursement and understaffing**. Epilepsy researchers in Zambia 
collaborating with faculty at the country’s only psychiatric hospital 
have developed and validated a screening instrument for anxiety and 
depression that is feasible in their setting and appropriate for the peo- 
ple seeking care”. Prior to its implementation, less than 1% of people 
receiving epilepsy care were treated or referred for psychiatric care. 
After implementation, this number increased to 32% (ref. 17). Despite 
the recognition that the psychiatric and social morbidity of epilepsy 
contributes substantially to the burden of those affected, a recent 
systemic review found that few rigorous intervention studies aimed 
at decreasing this social morbidity have been conducted*>. Local peer 
support groups in Zambia have been shown to decrease epilepsy-as- 
sociated stigma and improve medication adherence among young 
people**. This work was accomplished in an environment with time, 
resource and staffing constraints. The approaches taken may offer 
guidance to researchers from higher income regions. 


Diagnostic advances. Rigorous clinical investigations require the de- 
velopment or adaptation, and validation of instruments for categoriz- 
ing and quantifying disease and disability. One of the chief barriers to 
research in LMICs is the availability of such instruments. In the past 
decade LMIC-based researchers have undertaken instrument devel- 
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opment for neuropsychiatric disorders and dementia in China; neuro- 
disabilities in Africa*’; HIV-associated dementia and neuropathies in 
Kenya** and Zambia®; and stroke in Vietnam*’, Argentina*!, Portugal’? 
and Nigeria*®. Methodology and software have been developed to 
facilitate the use of neuroimaging to characterize and quantitatively 
describe novel conditions“ that could prove important in this era of 
emerging infectious diseases. 


Clinical trials. Although clinical trials conducted in LMICs might seem 
to be the study design that is least likely to inform care in high-income 
regions, a clinical trial addressing the use of invasive intracranial 
pressure monitoring conducted in Brazil and Ecuador has forced 
neurosurgeons and neurointensivists to reconsider long-held ideas of 
the value of such assessments”. Conducting clinical trials in settings 
that are less burdened by ingrained practice patterns and training 
conventions may offer crucial opportunities to collect the data that 
will allow us to examine whether common practices established in the 
absence of evidence are really warranted. 


OLDER ADULTS TO OLDEST OLD 
Frailty and functional disability often afflict old age. Degenerative dis- 
eases that affect vision and hearing compound health problems in older 
people. But these are not inevitable characteristics of ageing. Neurolog- 
ical diseases of old age, for which substantial literature exists in LMICs, 
include stroke, dementia, Parkinson’s disease and epilepsy. The quality 
of life of those affected is often poor and major depression may occur*®. 
Stroke is the most important cause of disability and a leading cause of 
death globally. A third of stroke patients in resource-limited settings die 
and another third are left with residual disability. The toll on economic 
productivity in these low-income countries that are poorly equipped to 
care for these individuals is considerable. According to the World Health 
Organization, stroke accounts for 55° of the disability-adjusted life 
years (DALYs) for neurological disorders, whereas Alzheimer’s disease 
is responsible for 12.0% (ref. 47). Currently, stroke is the most impor- 
tant cause of hospital admission in many African countries and this has 
been adduced to the high frequency of untreated hypertension. Late 
presentation in hospitals, poor facilities for adequate care provision and 
prevention of complications contribute to the high mortality rate. The use 
of thrombolytic agents is limited because of exorbitant cost and late pres- 
entation in hospitals. Data on the risk factors for stroke have come from 
the INTERSTROKE study, which included participants from five coun- 
tries in sub-Saharan Africa (Mozambique, Nigeria, South Africa, Sudan 
and Uganda). The frequency of intracerebral haemorrhage is relatively 
high in Africa compared with other regions; and the strongest association 
between hypertension and stroke was also reported in African partici- 
pants (odds ratio, 4.96 (95% confidence interval, 3.11-7.91) versus 2.79 
(95% confidence interval, 1.83-4.75) for developed countries. Factors 
associated with poor prognosis are impaired consciousness, swallowing 
problems, incontinence and chest infection based on data from Gambia**. 
Stroke will remain a problem unless urgent action is taken in health pro- 
motion and the provision of facilities for optimal care within the resourc- 
es available. Stroke units can provide crucial support for meeting some of 
these challenges and are available in some tertiary care facilities’. 
Dementia is the most common neurodegenerative disease that af- 
fects older adults and the oldest old. Data on disease burden is sparse in 
sub-Saharan Africa because of the relatively young population and the 
suspected concealment of cases within families*°. The prevalence in Af- 
rican communities varies between 2.29% and 10.1%, although direct 
comparison of results between studies may be hampered by differences 
in the methodology used. The rates are higher in South American coun- 
tries*', in East Asia, the Pacific and South Asia relative to the level of eco- 
nomic development*’. According to Alzheimer’s Disease International, 
58% of those affected live in LMICs, where awareness is considered to 
be poor®. Alzheimer’s is the predominant type of dementia, accounting 
for between 50% and 80% of cases'>. Other types are vascular dementia, 
Lewy body dementia and frontotemporal lobar degeneration. 
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BOX 1 | STUDIES CONDUCTED IN LMICS THAT 
INFORM RESEARCH IN HIGH-INCOME SETTINGS 


e Studies of genetic risk factors for bipolar | disorder have the 
potential to improve our understanding of these factors in 
immigrant populations in the United States™. 

Proposed studies of stroke prevention in sickle cell disease in 
Africa® and studies of cognition and/or psychosis in familial 
populations”’ in Mexico can directly affect the health of 
Americans with these disorders. 

Proposed studies of the impact of HIV1 viral diversity on 
cognition and most studies evaluating the neurological and 
behavioural outcomes associated with environmental toxins 
or exposures would not be feasible in the United States, but 
the results of these studies provide crucial insights into health- 
related concerns directly applicable to US populations. 
Research on outcomes of guideline-based management of 
increased intracranial pressure (ICP) rather than invasive ICP 
monitoring’ would not have been possible in the United States 
given the absence (appropriate or not) of equipoise among 
clinicians and entrenched practice patterns. 

Resource challenges facing research conducted in LMICs 
stimulate the development of more cost-effective diagnostics 
such as the tripolar concentric ring electrode based non-invasive 
transcutaneous focal stimulation for epilepsy”. 


‘The risk factors for dementia are similar to those in western coun- 
tries’. An intriguing observation was the lack of association between 
apolipoprotein E ¢e4 allele and Alzheimer’s in sub-Saharan Africa. 
Furthermore, a recent publication from the Indianapolis-Ibadan group 
reported significantly lower rates of dementia in Africa compared with 
ethnically similar populations in the United States*. This finding could 
indicate a paradigm shift in dementia risk and could have a bearing on 
rising dementia burden as a consequence of changing dietary habits. 
Vascular factors have also been shown to increase dementia risk. Hy- 
pertension after the age of 65 increases the risk of incident dementia in 
elderly Yoruba Nigerians**. A novel presenilin 1 mutation was found to be 
associated with Alzheimer’s in a South African family and was charac- 
terized by early-onset presentation’. Other types of dementia associated 
with neurodegenerative diseases, including frontotemporal dementia, 
probably exist in LMICs'. 

Social isolation is linked with cognitive decline owing to limited 
cognitive stimulation. The extended family system that has so far pro- 
vided a buffer for the care of older people in LMICs is being eroded by 
rural-urban migration and economic pursuits**. Furthermore, institu- 
tionalized care is frowned on in these countries because of the stigma 
of destitution. Carers face many problems, notably psychological stress 
and high rates of depression; carer distress, therefore, is an emerging 
problem. To obviate this, a model of care for LMICs is a creche-type ser- 
vice for those with dementia. Weight loss is another source of concern 
as this may be a pointer to the development of dementia®’. Intervention 
studies in regions where institutionalized care for older people is not an 
established norm may offer insights for high-income settings. 

Dementia is a public health priority and the use of medication is 
limited in LMICs; consequently, intensive efforts should be direct- 
ed at identifying the risk factors and making interventions to stem the 
tide. Education and lifestyle changes (for example, the consumption of 
healthy diets, physical activity and engagement in community activities 
to reduce social isolation) can go a long way to preventing dementia. Mild 
cognitive impairment (MIC) is the intermediate stage between normal 
ageing and dementia. It is classified into amnestic and non-amnestic 
types, and each type can affect single or multiple cognitive domains”. 
Single-domain amnestic MCI is a precursor of Alzheimer’s, and it is the 
most common. The prevalence of MCI varies between 10% and 40% in 
studies from Africa*!'. Vascular cognitive impairment follows stroke 
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with white matter lesions and medial temporal atrophy, and manifests 
as impairment of executive functions’. Important risk factors for MCI 
include age, hypertension, dyslipidaemia, illicit drug use and metabolic 
disorders. HIV infection also causes mild neurocognitive dysfunction 
and should be investigated based on clinical judgement. 

Parkinson’s disease is the most common movement disorder that 
affects older people and has both motor and non-motor manifestations. 
Dementia and depression are common co-morbidities. In Guam, Parkin- 
sonism is associated with Alzheimer’s and amyotrophic lateral sclerosis. 
Exposure to neurotoxins such as cycasin seems to be important in disease 
aetiology; oxidative stress also plays a part®. Many neurotoxins have been 
implicated in tropical myelinopathies, including cyanogenic glycosides 
present in poorly fermented cassava® and (§-N-oxalyl-amino-L-alanine 
present in the chickling pea (Lathyrus sativus; also known as the grass 
pea), which if consumed during drought causes lathyrism*’. These neu- 
rotoxins have not been implicated as risk factors for Parkinson’s disease. 

Symptomatic epilepsy in our global ageing population is redefining 
the epidemiology of a condition previously considered to be a disorder 
of children and young adults. In older people, the important aetiologi- 
cal factors are stroke, space occupying lesions, metabolic derangements, 
neurodegeneration and medication side effects. Neurocysticercosis and 
onchocerciasis are two tropical diseases that cause epilepsy in LMICs and 
should be looked for and treated. Prescribing cheap, cost-effective med- 
ications with minimal side effects and using simplified regimens could 
reduce the challenge posed by a treatment gap, which leaves most people 
with epilepsy in resource-limited settings untreated. 


INSIGHTS GAINED FROM WORK IN LMICS 


This review of a decade of research into brain disorders in LMICs illus- 
trates the rich diversity of disorders, geographical areas and popula- 
tions studied. The studies used a variety of methods, approaches and 
study designs, ranging from qualitative research to clinical or epide- 
miological approaches, and to translational and basic science projects. 

Although each study addresses issues of importance to brain dis- 
orders that are relevant to the host country, the body of research also 
has relevance to high-income populations. Global health research can 
inform our understanding of the health of all populations and improve 
health care in high-income settings by exploring the health of key un- 
derserved populations, such as recent immigrants or ethnic minorities; 
enabling research of disorders with high morbidity, but low preva- 
lence, in higher income settings; empowering researchers to challenge 
existing paradigms; identifying approaches or interventions with bet- 
ter cost-effectiveness than existing standards of care; and identifying 
best practices with superior outcomes. 


OPPORTUNITIES TO BUILD ON RECENT ADVANCES 

A decade of research has established a solid foundation on which to 
continue to build research collaborations and a body of knowledge 
about brain disorders in the developing world across the life course. 
Translating these scientific advances into individual or popula- 
tion-based prevention or treatment interventions that improve brain 
health is a crucial next step to achieve better health for individuals af- 
fected by brain disorders worldwide. 


FUTURE DIRECTIONS 
Essentially all research programmes are conducted in a resource-limited 
setting because of finite budgets, human resources and research infra- 
structure. Every programme undertaken comes at the cost of the opportu- 
nity for another research programme, development activity or investment. 
As such, some principles for priority setting might be considered. Specific 
programme content and priorities may vary regionally and development of 
programmes should be undertaken through partnerships in which LMIC 
partners direct the content and purpose on the basis of local needs. 

The impressive body of work emanating from LMICs and collabora- 
tions with researchers from these countries illustrates the value of such 
international partnerships to all involved (Box 1). Future research needs 
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Table 1 | Research priorities to address the neurological burden of disease in 
low- and middle-income countries 


Priorities 


Implementation studies 
of interventions or 
programmes found 

to have efficacy or be 
effective on a small scale 


Intersection of 
neurological NCDs and 
infectious and/or MCH 
conditions that remain 
problematic in LMICs 


Maximize use of existing 
data 


Example 


Screening and treatment for 
depression and anxiety among 
people with, or mothers of 
those with, epilepsy 

Evaluate varying systems of 
dementia care for health, 
health-related quality of life 
and cost outcomes 


Stroke, epilepsy, cognitive 
impairment in people living 
with HIV/AIDS 

Life-course evaluation of the 
impact on adult health of 


Potential approach 


. Scale up of developed 
programmes with 
multifaceted, 
community-based 
assessments 


° Randomized studies at a 


community level 


° Accessing populations 
for study through 


established HIV services 
° May require birth cohort, or 


acomparison of long-term 


childhood malnutrition and outcomes for countries 

obesity that did and did not meet 
Millennial Development 
Goals 


Further elucidate the burden 
of neurological diseases 
within the framework 
needed for health-service 
delivery, treatment and 
secondary prevention 


° Re-framing of GBD 2010 
data to quantify relative 
burden of neurological 
disorders as manifested 
rather than as viewed 
through the root cause 


° Databases are available 
that are not focused on 
neurological health, but 
which capture relevant 
exposures and outcomes 


Study new disorders that 
may offer opportunities 
to gain understanding 

of mechanisms for 
neurological disease 

or injury that are more 
broadly relevant 


Clinical trials of 
potentially affordable 
interventions informed 
by insights gained in 
high-income setting 
and aimed at improving 
neurological outcomes 


Nodding syndrome and 
konzo 


Avoidance of hyperthermia 
after neurological injury 
informed by post-arrest and 
post-HIE cooling protocols 
Possible inclusion of 
aggressive antipyretics 
rather than overt cooling 


° Requires collaborations 
with clinicians, 
epidemiologists and 
basic scientists 


. Pragmatic clinical trials 


GBD, global burden of disease; HIE, hypoxic ischemia encephalopathy; LMICs, low- and middle-income 
countries; MCH, maternal and child health; NCDs, non-communicable diseases 


to continue to address the shared burden of common conditions, in- 
cluding neurodegenerative disorders, stroke and epilepsy (Table 1). For 
neglected tropical diseases such as cysticercosis, rabies, trypanosomia- 
sis and leprosy, methods developed for the study of rare and orphaned 
conditions (most of which are neurological) could be applied to these 
diseases. Although epidemiological and translational research across the 
spectrum of diseases and methodologies is still needed in resource-lim- 
ited settings, an increased focus on interventional studies, such as clinical 
trials or population-based interventions is crucial. Interventions that are 
culturally relevant and feasible to implement in resource-limited set- 
tings, and which can be translated into effective policies that ultimately 
lead to improved health at both the individual and population levels are 
the essential next step and need to be prioritized to take full advantage of 
the body of knowledge generated so far. 
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Global research challenges and opportunities 
for mental health and substance-use disorders 
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The research agenda for global mental health and substance-use disorders has been largely driven by the exigencies of high 
health burdens and associated unmet needs in low- and middle-income countries. Implementation research focused on con- 
text-driven adaptation and innovation in service delivery has begun to yield promising results that are improving the quality of, 
and access to, care in low-resource settings. Importantly, these efforts have also resulted in the development and augmentation 
of local, in-country research capacities. Given the complex interplay between mental health and substance-use disorders, med- 
ical conditions, and biological and social vulnerabilities, a revitalized research agenda must encompass both local variation and 
global commonalities in the impact of adversities, multi-morbidities and their consequences across the life course. We recom- 
mend priorities for research — as well as guiding principles for context-driven, intersectoral, integrative approaches — that will 
advance knowledge and answer the most pressing local and global mental health questions and needs, while also promoting a 
health equity agenda and extending the quality, reach and impact of scientific enquiry. 
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he global mental health landscape has transformed over the past 

25 years because of the higher visibility of the burden of men- 

tal health and substance-use disorders'. These disorders com- 
prise 7.4% of global disability-adjusted life years (DALYs) and 22.7% 
of global years lived with disability (YLDs)? (Supplementary Informa- 
tion)’. The main contributors worldwide are depression and dysthymia 
(9.6% of all YLDs); anxiety (3.5% of all YLDs); and schizophrenia, sub- 
stance-use disorders and bipolar disorder (just over 2% of all YLDs). 
Alcohol and substance-use disorders come in second for most of the 
developing world, more so for southern Africa (drug use) and Eastern 
Europe (alcohol). The burden of mental health and substance-use dis- 
orders is predicted to increase worldwide in coming decades, and the 
steepest rise can be expected in low- and middle-income countries 
(LMICs) as a result of rising life expectancy, population growth and 
under-resourced health care*. For example, simulation models predict 
a 130% increase in associated health burden of alcohol and substance 
misuse in sub-Saharan Africa by 2050 as a result of population growth 
and ageing®. As substantial as they are, conventional health metrics do 
not capture additional social burdens attached to living with mental 
illness. Untreated mental health disorders are associated with a high 
economic burden’. Furthermore, pervasive stigma and human rights 
violations compound the suffering associated with these disorders and 
exacerbate social vulnerabilities’ °. 

As the health, social, economic and human costs of mental and 
substance-use disorders become increasingly better documented, 
political will and multilateral commitments to scale up mental health 
care in LMICs have grown. The World Health Organization has intro- 
duced a series of policy initiatives that articulate both high-level aspi- 


rations and pragmatic guidance for mental health and substance-use 
services delivery in LMICs. The most recent, the Global Mental Health 
Action Plan 2013-2020, challenges member states, partners and the 
Secretariat to collectively meet ambitious goals by the year 2020, 
including increasing mental health care coverage by 20% for severe 
mental health illness and reducing national suicide rates by 10% 
(ref. 10). A consideration of the timeline of these landmark events — 
including the roll out of anumber of key funding and policy initiatives 
that target the persisting resource gaps — illustrates the substan- 
tial momentum in integrating mental health into the broader global 
health agenda that has occured over the past few decades (See Sup- 
plementary Information). 

An interactive map, depicting the broad geographical distribution 
of current, promising initiatives in global mental health is available at 
http://www.nimh.nih.gov/responsive/map.shtml. Policy and pro- 
grammatic initiatives have laid a foundation for strengthened global 
mental health services by developing an initial consensus scientific 
agenda that focuses energies and funding on the most crucial research 
for building an empirical base. Key funding initiatives have support- 
ed research to leverage scarce resources and improve access through 
task sharing, integration of mental health care into existing primary 
health-care infrastructure and enhancement of diagnostic assess- 
ment. Increasingly, resources have been allocated to build in-country 
research capacities and strengthen collaboration through institutional 
partnerships". Complementary graduate-level training programmes 
in global mental health have also emerged, although mental health 
specialty training, as a track, remains relatively under-represented 
among other global health domains”, 
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Boston, Massachusetts 03115, USA. ‘Office for Research on Disparities & Global Mental Health, National Institute of Mental Health, 6001 Executive Boulevard, Room 
7207, Bethesda, Maryland 20892, USA. Correspondence should be addressed to F. B. e-mail: kamayonza@gmail.com. 
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Figure 1 | Burden of risk associated with substance-use disorders varies across the world as disability adjusted life years (DALYs). Reprinted with permission from ref.4. 


Key research gaps and challenges 

The global health burden of mental health disorders is exacerbated 
by the growing concurrent problems associated with substance mis- 
use. Substance use and exposure to addictive drugs have chronic and 
profound effects on neurobehavioural and neurodevelopmental func- 
tions. In LMICs, the socioecology of poverty, malnutrition, political 
conflicts and poor health systems influence the epidemiology, as well 
as the adverse outcomes, that result from substance misuse. Addi- 
tional challenges associated with co-morbidity stem from its aug- 
mentation of clinical burden, through increased risk for relapse, other 
infectious and medical complications, and economic hardship and 
homelessness. In this context, co-morbid substance use and mental 
illnesses in particular may contribute to increasing health burden. The 
prevalence of substance-use disorders has escalated in recent dec- 
ades, reaching 5.4°% of the total disease burden and 9.1°% when tobac- 
co use is included. 

Individuals with substance-use disorders are also likely to have 
other mental health problems, including depression and schizophre- 
nia‘!>, Similarly, a large proportion of people with mental illnesses 
also have substance-use disorders'*!”. Research that investigates the 
relationship between mental illness and substance-use disorder has 
yielded mixed findings, with some support for causal relationships in 
both directions as well as for shared genetic, environmental, social and 
cultural risk factors. For example, cannabis use is linked to a risk of 
developing psychotic illness'*. Conversely, mental illness may increase 
the risk of substance misuse; individuals may ‘self-medicate’ with 
alcohol, tobacco or amphetamines as a means of coping with distress 
and negative affects!’*°. Some factors, including genetic vulnerabili- 
ties, traumatic exposures and stress, may confer risk for both condi- 
tions)”. Diagnosis and treatment of co-morbid substance misuse and 
mental health illness remains a significant challenge, particularly in 
LMICs. The burden of this co-morbidity is further exacerbated by the 
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increased clinical complexity that stems from resistance to treatment, 
risk of relapse, vulnerability to other infectious and medical complica- 
tions, and increased economic hardship and homelessness. 

The burden and configuration of risk associated with substance-use 
disorders and co-morbid mental illness seem to vary across the world 
(Fig. 1)“. Although alcohol and opioid problems are escalating in Eu- 
rope, Africa and Asia, problems associated with amphetamines and 
cannabis are more prevalent in Asia, North America and Europe. Co- 
caine use is prevalent in North America and Europe, whereas misuse of 
indigenous psychoactive substances is prevalent in other regions, such 
as the use of khat in parts of Africa and the Middle East and that of 
coca leaves in South America’. Notably, existing knowledge gaps may 
underestimate the impact of substance-use disorders”. The full extent 
of adverse mental health and social impacts of substance-use disorders 
such as alcohol use during pregnancy and fetal alcohol spectrum disor- 
ders remain incompletely understood. 

Mental health and substance-use disorders also frequently co-oc- 
cur with other diseases, increasing associated morbidity and mor- 
tality risk**?’, It is not uncommon for individuals with HIV/AIDS or 
non-communicable diseases such as hypertension, diabetes and car- 
diovascular disease to also have symptoms of depression or anxiety and 
to use alcohol or other drugs to excess. Attention deficit hyperactivity 
disorder has been associated with risky sexual behaviours that can re- 
sult in transmission of HIV/AIDS. These interdependent illnesses stem 
from common risk factors, such as childhood adversity; and bidirec- 
tional influences, such as poor treatment adherence” and increased 
engagement in risky behaviour” *!. Growing awareness of the complex 
interplay between mental illness and the increasing burden of chron- 
ic disease globally has prompted research that examines the effects of 
depression on adherence to medical treatments and the effects of inte- 
grated care — co-treatment of high blood pressure and depression, for 
example — on the outcomes of both of the co-occurring illnesses (see 
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for example refs 32-34). A life-course approach to risk reduction that 
takes into account risks that occur in childhood and early adulthood, 
and that promotes a healthy lifestyle, and early recognition and treat- 
ment of mental and substance-use disorders is essential to curtail the 
long-term negative impacts of many preventable health risks. 


TREATMENT GAP 

The proportion of people who need, but do not receive care is especial- 
ly high in LMICs*>"*. The inadequate resourcing of mental health care 
in LMICs has been widely documented and critiqued. For example, on 
average less than 3°% of public health resources are allocated to specif- 
ic mental health care in LMICs, with even less (around 1%) in Africa 
and Asia*’”. Most LMICs have far fewer health-care professionals than 
they need to deliver mental health and substance-use interventions 
to everyone who needs them**?. Scaling up services will require more 
than training additional psychiatrists, psychologists and psychiatric 
nurses, however, strategic leveraging of scarce resources will also be 
necessary. In particular, task shifting — delegating health-care tasks 
from specialists to various non-specialist health professionals and oth- 
er health workers — has shown promise for certain mental health and 
substance-use interventions* *’. In addition, the integration of men- 
tal health services into primary health-care delivery settings through 
community-based and task-sharing approaches can both help to 
reduce burden on carers and improve access and the coordination of 
care. Mental health services and health-system strengthening, and in 
particular, task shifting, as well as organization and ways of delivering 
community-based mental health services in LMICs should be prior- 
itized for research. 

There is also a substantial gap in scientific knowledge for prevent- 
ing and treating mental health and substance-use disorders. In addi- 
tion, what is currently known is often not applicable to low-resource 
regions. Intervention strategies to address substance-use disorders 
have improved over recent decades, but have had limited success in 
achieving total recovery and have limited coverage in LMICs'°. More- 
over, resources for providing these interventions are constrained or 
lacking in most LMICs*4, Models for improving availability and ac- 
cess to effective mental health care emphasize the integration of both 
prevention and treatment services within primary care systems. This 
has been a core approach taken by the WHO Mental Health Gap Action 
Program (mhGAP)**-45, 

Most published clinical trial data on therapeutics for mental health 
disorders are based on research conducted in high-income coun- 
tries**-**. In the absence of region-specific empirical data, deployment 
of these therapeutic strategies in LMICs is a reasonable pragmatic com- 
promise in the short term when informed by local knowledge, and 
pending rigorous and systematic evaluation. Local research on clinical 
effectiveness of these treatments and implementation research on how 
to deliver these therapies and scale them up are urgently needed. 


Priorities for advancing the global mental health agenda 

Our recommendations build on the strong base of empirical evidence 
and previous consensus statements and reports that have articulated 
principles, needs and priorities that should inform a robust research 
agenda (Table 1). The predominant focus of global mental health re- 
search is currently on health services and implementation research, 
areas that align well with efforts to close treatment gaps and that must 
continue to be strengthened. Whereas we regard these contributions 
as formative and arguably the most pragmatic and exigent in the short 
term, they should not pre-empt a more ambitious scope of scientif- 
ic inquiry that ranges from basic sciences to health policy. Innova- 
tion should encompass much more than strategies to leverage scarce 
resources lest the scope of progress in the field be consigned to im- 
proving the efficiency of old models of care delivery. Instead, comple- 
mentary and parallel lines of context-driven research should also aim 
to advance the scientific understanding of aetiology, population-spe- 
cific phenotypic variation in presentation and course, and differential 
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response to therapeutics through promising avenues in neuroscience, 
biomarkers, genetics and epigenomics. 


EPIDEMIOLOGY 

Epidemiological research is crucial to better understand the differential 
risk factors and burden of mental health and substance-use disorders 
across diverse geographical regions and social contexts. Refinement 
of approaches to diagnostic assessment that are both locally valid and 
relatable to global classification is essential to more effective and effi- 
cient case identification, particularly in the hands of non-specialists. 
Such advances will generate more accurate estimates of health burdens 
and salient risk factors on which local health policymakers can draw. 
In addition, research is needed to better define the health and social 
impacts of syndemic mental health disorders, substance-use disorders 
and medical diseases, as well as to understand how social adversities 
moderate and mediate risk of onset, severity and course. Such research 
will inform optimal strategies for prevention, treatment and follow-up 
care for individuals with these co-morbidities. 


BASIC SCIENCE RESEARCH 

The research agenda to address the global burden of mental health 
and substance-use disorders should build on recent advances in the 
field of basic neuroscience, biomarkers, proteomics, and genetics and 
epigenetics. For example, research in the past decade has identified 
molecular and structural markers connected with mental health and 
substance-use disorders*®. These include protein alterations in the 
form of upregulation of a 40-amino acid VGF-derived peptide and the 
downregulation of apoAl protein in schizophrenia®®. Hormonal and 
physiological alterations in stress- and appetite-related neuropep- 
tides have also been pursued in the context of addiction and treatment 
outcome *’. There has also been significant interest in epigenomics 
and how it could advance our understanding and use of biomarkers. 
Epigenomic modifications affect gene expression, and involve multi- 
ple molecular steps, including DNA methylation. In light of evidence 
that indicates a role for epigenetic mechanisms in modifying genes 
that increase propensity for drug use and mental illness, it is impor- 
tant to develop a means by which this approach could be harnessed to 
improve the validity and reliability of diagnostic measures as well as 
to help to tailor interventions to the individual. Research that consid- 
ers the diversity of environmental exposures and gene-environment 
interactions across different settings can advance the utility of these 
markers to confirm diagnosis and to predict treatment outcome. Fur- 
thermore, such markers may be useful in identifying those at high risk 
so that measures can be applied to prevent initial risk or onset, or slow 
down or prevent progression towards psychopathology. The use of such 
approaches should also parallel the development of conceptual mod- 
els to guide understanding of the complex, multidimensional aetiol- 
ogy of mental health and substance-use disorders. To that end, global 
research that focuses on mental health and substance-use disorders 
should take into account how genetics and exposure to environmental 
toxins interact with social, cultural and environmental conditions to 
moderate the risk of these disorders. 


HEALTH DELIVERY AND IMPLEMENTATION RESEARCH 

Four out of the top five research priorities identified in the grand chal- 
lenges statement — developed by a consortium of researchers, advo- 
cates and clinicians with funding from the US National Institute of 
Mental Health (NIMH) and the Global Alliance for Chronic Diseases — 
fall in the domain of enhancing the quality of, and access to, mental 
health care. This call to invest in health services and implementation 
research is in response to identified treatment gaps as well as their nu- 
merous antecedents, such as weak health systems, shortfalls in human 
and financial resources, and social structural barriers to care. There is 
ample evidence for science-based care and the integration of mental 
health services into primary health care. However, we still lack crucial 
knowledge on how best to disseminate and implement evidence-based 


19 November 2015 | 7578 | 527 


BAINGANA ET AL. | MENTAL HEALTH AND SUBSTANCE-USE DISORDERS 


Table 1| Summary of priority research areas and guiding principles for approaches to reduce the global burdens of mental and substance-use disorders. 


Research domain 


Priority areas 


Epidemiology ° 


Refine approaches to diagnostic assessment across diverse local contexts by enhancing their local validity within universal frameworks, and promoting and 
evaluating their clinical utility for non-specialist health professionals and other health workers, including in a wide variety of health-care and community-based 
settings 

Improve accuracy of estimates of health and related social and economic burdens 

Advance understanding of local contextual factors that mediate and moderate risk, course of illness, and recovery for mental and substance-use disorders by 
identifying synergies among co-morbid mental health and substance-use disorders, and medical conditions relating to risk, illness trajectories and treatment 
outcomes; understanding context-specific impacts of social adversities and local resources that promote resilience; illuminating social vulnerabilities that 
impede access to care; examining and comparing population-specific phenotypic variants of major mental health and substance-use disorders to enhance 
understanding of genetic, developmental, environmental, social and cultural contributions to aetiology, course and presentation 

Evaluate health, social and economic benefits of general health, mental health and substance-use disorder therapeutic intervention, prospectively across the 


life course and the next generation 


¢ Advance understanding of gene-environment interactions with the scientific benefits that variation across diverse contexts provides 
¢ Identify moderators of course of illness, therapeutic response, remission, relapse and recovery 


Health-care delivery and ¢ Optimize effectiveness of task-sharing models for community-based case-finding and treatment delivery 
implementation 


¢ Understand and resolve barriers to integration of mental health and substance-use disorder assessment and treatment in primary care settings and in non- 


health platforms (for example, schools, housing and the criminal justice system) 


e Mitigate barriers to care access 


¢ Develop and improve coordinated approaches to strategic preventive interventions, monitoring and targeted treatments over the life course and across 


disorders 


¢ Evaluate successful delivery models at scale and their adaptation to diverse local contexts 
¢ Advance understanding of factors that mediate and moderate successful care delivery at scale 


Guiding principles 


Illuminate ‘sideways’ impacts of economic, housing, criminal justice and education policies on mental health and substance-use disorders 
Develop evidence-based strategies that accelerate the uptake of mental health research findings by policymakers 


Responsive to exigent needs related ¢ Consistent with the broad global health agenda of reducing health burdens, research recommendations focus on a globalizing framework and leveraging 


to burden of disease and associated 


treatment gaps 


understood 


most salient local research priorities 


Integrative across primary and i 
specialty clinical care domains and 


the life course these disorders 


education and other sectors 


Capacity building . 


strategy that can bring effective interventions to a larger scale 
¢ This pragmatic focus should not eclipse the complementary relevance and importance of extending research into comparative differences in mental and 
substance-use disorders across diverse populations and social contexts 


© Diversity across local social, economic, environmental, cultural and political contexts of health and health care has profound implications for risk, resilience, 
presentation, consumer demand, therapeutic outcomes, and impacts of mental and substance-use disorders; however, these factors remain incompletely 


¢ Engagement of local knowledge is an essential complement to universalizing frameworks and the aggregate global empirical base in developing a fully 
contextualized understanding and optimization of local responses to mental health and substance-use disorders as well as in formulating and pursuing the 


Mental disorders, substance-use disorders and medical conditions are frequently co-morbid and thus development, evaluation and implementation of delivery 
models that enhance coordination and integration of care — across disorders, conditions and the life course — are likely to reduce the burden and impacts of 


¢ Improved links between scientific evidence and policymaking can enable and accelerate translation from research to practice 
¢ Cross-sector planning can promote efficiencies and achievement of health sector goals by anticipating necessary preliminary or parallel actions in finance, 


Research activities should encompass an agenda and plan for in-country capacity building to grow the global research work force, extend the quality and reach 


of scientific enquiry, and promote a health equity agenda 
¢ Capacity-building should be bidirectional and collaborative, when possible, to tap complementary expertise and experience in research, clinical practice, 
implementation, policy and other relevant dimensions that will advance the local and global mental health agenda 


mental health interventions in resource-poor contexts, including 
those characterized by the extreme social adversities associated with 
political conflict, displacement and destitution. Future research is 
therefore necessary to rigorously evaluate and optimize effectiveness 
of task sharing, integration of mental health into primary care, and 
deployment of the mhGAP algorithms at larger scale and across diverse 
social settings*+*°. 

Key strategies for expanding access to high-quality mental health 
care in LMICs come from models that are successful in leverag- 
ing scarce resources in other clinical domains. However, challenges 
unique to care delivery for mental health and substance-use disor- 
ders warrant special attention and innovation. These include how to 
improve diagnostic assessment and population health surveillance, 
given the heterogeneous and sometimes opaque presentations of signs 
and symptoms across diverse social and cultural contexts*”**; how to 
address the social and cultural factors, especially stigma, that hinder 
access to care and may prevent patients with mental illnesses and sub- 
stance-use problems from using the resources available for prevention 
and treatment; how to mitigate social vulnerabilities, such as poverty 
and gender-based violence that elevate risk of mental disorders, while 
building on sociocultural resources that promote coping and resilience; 
how to develop coordinated approaches to strategic preventive inter- 
ventions, monitoring and targeted treatments over the life course and 
across disorders, given developmental trajectories of mental health 
and substance-use disorders, and their harmful symbiosis with other 
chronic conditions and vulnerabilities; and how to rapidly scale up ef- 
fective interventions to close the treatment gap in resource constrained 
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environments“. Priorities for global mental health research resonate 
with the global health agenda, with its focus on reducing health bur- 
dens. In this respect, a globalizing framework aimed at developing 
approaches that are effective when scaled up and implemented across 
geographically and socially diverse settings and populations reflect 
pragmatic goals of responding to pressing needs. We emphasize, how- 
ever, that closing the prevailing treatment gaps for mental health 
and substance-use disorders will also depend on fortifying scientific 
inquiry so that we can understand the, sometimes remarkable, local 
variation in manifestation and course of mental disorders”. 


TRANSLATIONAL AND HEALTH-POLICY RESEARCH 

Ensuring that populations receive high-quality care that improves 
mental health is the purview of policymakers. Shaping sound public 
policies that are based on up-to-date research can be challenging, 
but promising examples exist. An experimental housing policy called 
Moving to Opportunity found that moving from a high-poverty to a 
lower-poverty neighbourhood improved adult physical and mental 
health and subjective wellbeing over 10-15 years, despite no change 
in average economic status“. Moving to Opportunity was able to capi- 
talize on the fact that public policy decisions are interconnected — it is 
not just health policies that influence mental health, substance use and 
other public health outcomes, but also economic, housing and crimi- 
nal justice policies, among others®. Rapid growth in mental health and 
substance-use research over the past decade, as well as appeals from 
researchers and advocates to apply the findings in policy and practice 
have not yet bridged the divide between what is known and what is 
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done®*’, The intricacies of ensuring evidence-based health policy 
are not entirely understood®, but a few effective practices are being 
used. Advocacy organizations such as the National Alliance on Mental 
Illness have become trusted sources of digestible research findings’. 
Carefully planned links between researchers and decision makers 
— an approach increasingly encouraged by funders of mental health 
and substance-use research — can also be effective®’. Such links of- 
ten involve collaboration among researchers, government agencies, 
advocates and provider institutions to synchronize research activities 
with policies, health-care demands and community priorities, and 
to engage key stakeholders in the identification of pressing research 
questions and the use of study findings. In this way, policymakers have 
become partners in the research enterprise, helping researchers to un- 
derstand what information is needed for developing or updating poli- 
cies, making investment decisions, expanding access to care, improv- 
ing care quality and monitoring system-level change over time. The 
long-term goal is that these partnerships will mobilize political will, 
inform policy development, and shed light on the essentials of shaping 
science-informed mental health and substance-use policies. 
Inclusion of mental health as an explicit priority in the post-2015 
development agenda (such as that included in the UN Open Working 
Group on Sustainable Development Goals, 2015; https://sustaina- 
bledevelopment.un.org/content/documents/7891TRANSFORM- 
ING%200UR%20WORLD.pdf) provides an opportunity to mobilize 
the requisite political will and resources at several levels so that this 
ambitious agenda for research and capacity building can be realized. 
Lessons learned from the positive health impact as a result of Millen- 
nium Development Goals 4, 5 and 6 illuminate how multisector and 
multilevel cohesion of effort and commitments are powerful levers for 
advancing health in low-resource settings, and an opportunity for the 
broad community of stakeholders and advocates to improve care for 
individuals living with mental health and substance-use disorders. 


COLLABORATIVE CAPACITY BUILDING 

New commitments and additional resources will be needed to rapid- 
ly cultivate the in-country research capacity needed to respond to the 
global disease burden of mental health and substance-use disorders”. 
The most culturally sensitive, scientifically and ethically sound, and 
locally relevant research requires investigators who best understand 
and live among the populations that they study. Funding initiatives 
such as the Fogarty International Center’s Global Brain and Nervous 
System Disorders Across the Lifespan programme (http://www.fic. 
nih.gov/Programs/Pages/Brain-Disorders.aspx), the NIMH’s Col- 
laborative Hubs for International Research in Mental Health (http:// 
www.nimh.nih.gov/about/organization/gmh/globalhubs/ index. 
shtml) as well as Grand Challenges Canada’s Global Mental Health 
granting programme (http://www.grandchallenges.ca/grand-chal- 
lenges/global-mental-health/) that explicitly structure research ca- 
pacity building into grant requirements, provide exemplary platforms 
to test and ultimately to systematize innovative strategies for training, 
mentorship and building a research culture and other infrastructural 
support for research in LMICs. 

In addition, collaborative capacities to advance the mental health 
and substance-use research agenda must be developed. Capaci- 
ty building in knowledge management is also integral to packaging 
accrued evidence so that it is accessible to policymakers and men- 
tal health technology specialists in LMICs. Platforms for knowledge 
sharing (for example, the Mental Health Innovation Network, http:// 
mhinnovation.net/; and GHD Online, http://www.ghdonline.org/) 
can promote scientific discovery and help to harmonize the mental 
health and substance-use disorder research goals, processes and tools, 
and to catalyse the translational potential of research to policy and pro- 
grammes”. Moreover, these platforms are needed to build and consoli- 
date the community of advocates, consumers, investigators, clinicians 
and policymakers united in their commitment to mitigate the suffering 
associated with mental health and substance-use disorders, eliminate 
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their attendant stigma, diminish their social and economic burdens, 
and erase the social and health disparities perpetuated by poor access 
to high-quality mental health care. 


CONCLUSIONS 

The formidable and rising health, economic and social burdens associ- 
ated with mental health and substance-use disorders call for the pri- 
oritization of research that can inform a global response — through the 
development and enhancement of preventive and therapeutic strat- 
egies, health-system strengthening and policymaking — to alleviate 
suffering and stem the associated economic and social consequences 
of unmet needs. Indeed, the potential synergies among breakthroughs 
in basic neuroscience, epidemiological methods and implementation 
science, as well as the mobilization of resources and political will have 
generated optimism and catalysed a commitment to act among poli- 
cymakers, advocates and the scientific community. Although the in- 
crease in mental health research initiatives over the past two decades 
are encouraging for the future challenges remain and patterns of pro- 
gress have been inconsistent. We find, for example, that although re- 
sponse to the growing burden of depression in LMICs has led to an in- 
crease in the number of studies on effectiveness of treatments, delivery 
methods and task shifting to provide access to care for all populations, 
we do not see this same trajectory of efforts to address substance-use 
disorders. This occurs with the background of growing substance-use 
problems globally. Approaches to address substance-use disorders in 
LMICs are still limited, fragmented and not well vetted scientifically or 
culturally. On an optimistic note, the draft Social Development Goals 
to be passed by the UN General Assembly in September 2015 recognise 
mental health as integral to health and mental health is explicitly in- 
cluded within universal health coverage; in addition, the UN General 
Assembly will hold a special session on drugs in 2016. These develop- 
ments have symbolic and substantive importance, and auger well for 
mental health within the Global Health agenda in the coming years. 
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Global research priorities for infections that 
affect the nervous system 
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Infections that cause significant nervous system morbidity globally include viral (for example, HIV, rabies, Japanese encephalitis 
virus, herpes simplex virus, varicella zoster virus, cytomegalovirus, dengue virus and chikungunya virus), bacterial (for example, 
tuberculosis, syphilis, bacterial meningitis and sepsis), fungal (for example, cryptococcal meningitis) and parasitic (for example, 
malaria, neurocysticercosis, neuroschistosomiasis and soil-transmitted helminths) infections. The neurological, cognitive, behav- 
ioural or mental health problems caused by the infections probably affect millions of children and adults in low- and middle-in- 
come countries. However, precise estimates of morbidity are lacking for most infections, and there is limited information on the 
pathogenesis of nervous system injury in these infections. Key research priorities for infection-related nervous system morbidity 
include accurate estimates of disease burden; point-of-care assays for infection diagnosis; improved tools for the assessment 
of neurological, cognitive and mental health impairment; vaccines and other interventions for preventing infections; improved 
understanding of the pathogenesis of nervous system disease in these infections; more effective methods to treat and prevent 
nervous system sequelae; operations research to implement known effective interventions; and improved methods of rehabili- 
tation. Research in these areas, accompanied by efforts to implement promising technologies and therapies, could substantially 


decrease the morbidity and mortality of infections affecting the nervous system in low- and middle-income countries. 
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ecent improvements in the detection of infectious organisms 

that can affect the nervous system has led to the realization that 

a substantial proportion of chronic neurological, cognitive and 
behavioural disease may actually have an acute and preventable origin. 
Infectious organisms may infect the nervous system directly, as in ra- 
bies and bacterial meningitis, or may cause neurocognitive disorders in 
the absence of direct infection of the nervous system, as in malaria or 
hookworm infection. This Review identifies global research priorities 
for infections that affect the nervous system, with the ultimate goal 
of stimulating research in these priority areas to substantially reduce 
morbidity associated with nervous system infections worldwide. 


METHODS 

For this Review, we chose illustrative infections that cause consid- 
erable nervous system morbidity in children and adults in low- and 
middle-income countries (LMICs). These infections are examples and 
are not meant to be exhaustive. Estimates of infection global frequency 
and types of nervous system involvement were obtained through Pu- 
bMed searches using the infection name and were accompanied by any 
of the following terms: neurologic, nervous system, cognition, cogni- 
tive, development, neurodevelopment, impairment, deficit, sequelae, 
brain injury, brain damage, mental health, behavioral or neuropathy. 
If available, World Health Organization (WHO) documents were also 
reviewed for each disease. The authors came to a consensus on the key 


research priority areas, on the basis of a literature review and research 
experience. 


INFECTIONS AFFECTING THE NERVOUS SYSTEM 

The global distribution, frequency and types of neurological, cognitive 
and mental health disorders associated with key infections are pre- 
sented in Table 1. For classification purposes, infections are reviewed 
according to type of microorganism (virus, bacteria, fungus or par- 
asite) in the sections that follow. However, microorganisms within a 
group (for example, the viruses HIV and rabies) can affect the nervous 
system in as varied a manner as microorganisms of different groups 
(for example, the virus HIV and the malaria-causing parasite Plasmo- 
dium falciparum). 


Viral infections 

Worldwide, rabies and Japanese encephalitis virus (JEV) are respon- 
sible for an estimated annual mortality of 60,000 and 17,000 people, 
respectively. Cases of JEV encephalitis are restricted to Asia, whereas 
rabies is a scourge throughout Southeast Asia, Africa and Latin Ameri- 
ca and occurs, although less frequently, in other areas worldwide. Ra- 
bies and herpes simplex virus (HSV) encephalitis, which is also present 
worldwide, lead to high mortalities without treatment’. JEV has var- 
iable mortality, depending, in part, on the infected individual’s age. 
Among survivors, long-term cognitive or neurological impairment is 
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present in as many as 70% of those with HSV encephalitis* and 30- 
50% of those with JEV encephalitis’. Most cases of JEV infection (as 
opposed to encephalitis) are asymptomatic or mildly symptomatic and 
require no treatment®, but the long-term neurocognitive consequences 
of asymptomatic or mildly symptomatic JEV infections are unknown. 

Varicella zoster virus (VZV), the cause of chickenpox, can, like other 
herpesviruses, establish a latent infection. Reactivation is usually owing 
to suppression of cell-mediated immunity, most commonly age-related 
immunosenescence. Central nervous system (CNS) reactivation is rela- 
tively uncommon, but reactivation in a dorsal root ganglion can lead to 
herpes zoster, which is associated with debilitating chronic pain’. Her- 
pes zoster is the single most common infection of the nervous system, 
with an estimated one million new cases each year in the United States 
alone’. There is little data on nervous system VZV infection in LMICs. 

Congenital cytomegalovirus (CMV) infection is the most com- 
mon acquired cause of hearing loss in children in the United States’. 
The percentage of the population testing positive for CMV is higher in 
LMICs than in high-income countries, but the incidence of congeni- 
tal CMV infection and of symptomatic CNS disease and hearing loss in 
most LMICs remains almost unknown”. 

Dengue, and to a lesser extent chikungunya, viruses will probably 
become leading global causes of arboviral encephalitis in the next dec- 
ade?!_ Although encephalitis is present in only a fraction of those in- 
fected with dengue virus, the large number of dengue infections world- 
wide could lead to hundreds of thousands of encephalitis cases (Table 1). 


HIV and opportunistic infections 

In 2012, an estimated 2.3 million people were infected with HIV, and 
1.6 million died of AIDS-related illnesses worldwide’’. HIV-associated 
neurological syndromes are classified as primary HIV infection, sec- 
ondary or opportunistic infection, and treatment-related neurological 
disease. Most often, primary HIV infection causes acute aseptic (viral) 
meningitis or meningoencephalitis (MEC). HIV-associated neurocog- 
nitive disorder (HAND) is a neurodegenerative condition characterized 
by cognitive, motor and behavioural abnormalities that is becoming 
more common with an increase of HIV in people older than 50 years". 
In LMICs, where only one third of patients requiring highly active an- 
tiretroviral therapy (HAART) receive it, the opportunistic infections 
cryptococcal meningitis, tuberculous meningitis, cerebral toxoplas- 
mosis, progressive multifocal leukoencephalopathy and CNS cytomeg- 
alovirus infection remain common”. 

The widespread implementation of combination antiretroviral 
therapy (cART) has changed the presentation, manifestation and ep- 
idemiology of many conditions and opportunistic infections, owing 
to immune reconstitution syndrome and increased prevalence of cog- 
nitive impairment and neuropathy with additional morbidities in pa- 
tients who are now living longer"®. The epidemiology and neurological 
outcomes of HIV infection are also affected by underlying malnutrition 
and variations in endemic pathogens. 

Cryptococcal meningitis is a leading cause of mortality in LMICs 
where access to antiretroviral therapy is limited!’. Most cases occur in 
sub-Saharan Africa, followed by South and Southeast Asia'®. Seroprev- 
alence for Toxoplasma gondii in people with HIV ranges from 10% to 
80°% with the highest proportions in African countries”, and cerebral 
toxoplasmosis is the most common cerebral mass lesion in patients 
with AIDS. 


Bacterial infections 
The most common bacterial infections affecting the nervous system are 
sepsis and meningitis in neonates; bacterial meningitis due to Strepto- 
coccus pneumoniae, Haemophilus influenzae type b and Neisseria menin- 
gitidis in children and adults; and tuberculous meningitis in children 
and adults. 

Neonatal meningitis and neonatal sepsis are associated with long- 
term neurological and cognitive impairment”; primarily impairment 
of hearing, vision or motor function; cerebral palsy; and epilepsy. In 
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LMICs, it is estimated that 23°% of neonates who have survived men- 
ingitis sustain moderate to severe neurodevelopmental impairment. 
Staphylococcus aureus, Gram-negative infections, including Escherichia 
coli, Klebsiella pneumoniae, Acinetobacter, non-typhoidal Salmonella 
and group B Streptococcus, are the leading causes of neonatal sepsis 
and meningitis in most LMICs” 4. Recent reports from Africa and India 
suggest an alarming increase in drug resistance among Gram-negative 
organisms infecting neonates**>. 

In children and adults in high-income countries, bacterial men- 
ingitis due to S. pneumoniae, H. influenzae type b and N. meningitidis 
has decreased dramatically following immunization with conjugate 
vaccines**. However, availability of these vaccines in LMICs is vari- 
able, and bacterial meningitis still affects 1.2 million individuals an- 
nually”*, causing neurocognitive sequelae in 23°% of affected children. 
Steroids as an adjunctive therapy have reduced neurological sequelae 
in high-income countries, particularly in adults, but have shown no 
benefit in LMICs?’. Factors such as organism strain, co-infections, ad- 
junctive and supportive therapies, and underlying conditions such as 
poor nutrition that can affect immune response often differ between 
high-income countries and LMICs, and may play a part in the different 
results seen in some clinical trials. 

Tuberculous meningitis (TBM) occurs in around 1% of all cases of 
tuberculosis, but is the most severe form of extrapulmonary tubercu- 
losis, resulting in death or severe disability in about 50° of those with 
the disease”*. Bacterial meningitis, and particularly TBM, can result in 
hydrocephalus, which is often difficult to treat in LMICs because neu- 
rosurgeons are typically not available and the supplies for ventriculop- 
eritoneal shunt placement are often not present or very limited. 

The WHO estimated that approximately 10.6 million new cases of 
syphilis occurred in 2008 (ref. 29), but precise estimates of the inci- 
dence of neurosyphilis are not available. 


Parasitic infections 

Malaria in humans is caused by one of five Plasmodium species, but 
neurological disabilities are most frequently associated with Plasmodi- 
um falciparum. Although P falciparum does not directly infect brain tis- 
sue, severe infection can lead to coma. One in four children with cer- 
ebral malaria develops long-term cognitive impairment*’, and recent 
studies suggest that children with severe malarial anaemia also have 
long-term cognitive impairment*!. Behavioural problems and epilepsy 
are other long-term consequences of cerebral malaria”. Children with 
repeated episodes of uncomplicated malaria have motor and cognitive 
problems**. The mechanisms by which malaria leads to neurocognitive 
problems are not fully defined, and the neurocognitive burden of ma- 
laria owing to other Plasmodium species has not been characterized. 

Neurocysticercosis, which is endemic in areas with poor pig man- 
agement practices and sanitation, occurs when the larval stages of 
Taenia solium infect the brain. In LMICs in which neurocysticercosis is 
endemic, it is the leading identified cause of seizures. The proportion 
of larval infections migrating to the brain is unknown, but some indi- 
viduals with neurocysticercosis never show neurological symptoms™. 
Neurocysticercosis can be fatal, most often following complications of 
surgery to treat the hydrocephalus associated with intraventricular or 
subarachnoid neurocysticercosis, but overall mortality estimates have 
been difficult to obtain®. 

Neuroschistosomiasis results from the migration of schistosome 
eggs or worms to the brain or spinal cord, and may occur following 
infection with Schistosoma japonicum, Schistosoma mansoni or Schisto- 
soma haematobium. Brain involvement may occur in the acute phase 
(acute schistosomal encephalopathy) or in the chronic phase (cere- 
bral schistosomiasis or pseudotumoral encephalic schistosomiasis). 
The spinal cord may also be involved, often leading to hemiparesis”. 
CNS symptoms and epilepsy are reported to occur in 2.6% and 2.1% 
of S. japonicum infections, respectively*’. Neuroschistosomiasis can be 
fatal, especially in its tumour-like form when it affects the cerebellum, 
but accurate mortality rates are unavailable*. 
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Table 1 | Neurocognitive and mental health consequences of major infectious diseases that affect the nervous system. 


Infectious 
disease 


Regions 
affected 


Arboviruses 


Estimated prevalence 
or annual incidence of 
infection 


Health consequences 


Neurological 


Cognitive 


Mental health 


Global, most 
common 

in South 
Asia, Africa 
and Latin 
America 


Dengue virus 


Chikungunya virus | Global, most 


390 million (95% Cl, 
284-528 million) 


33,000-93,000 


Meningitis, meningoencephalitis, encephalitis, seizures, 


Guillain-Barré syndrome, neuralgic amyotrophy, 
hypokalaemic paralysis, and dengue myositis 


In one cohort, dengue had neurological manifestations in 


9.3% of children and adults 


There is limited information about long-term sequelae in 


dengue, but there is evidence of significant long-term 
neurological complications 


Encephalitis, febrile seizures, meningismus, myelopathy or 


Not studied 


Not studied 


Case reports of mania 
and depression 


Not studied 


common myeloneuropathy 

in South 

Asia, Africa 

and Latin 

America 
Japanese Southeast 35,000-50,000 CNS complications during the acute illness include Among survivors, 30-50% have Among survivors, 30— 
encephalitis Asia delirium, seizures, axial rigidity, extrapyramidal significant neurological, cognitive 50% have significant 

signs, cranial nerve palsies, ataxia, paraplegia and or psychiatric neurological, 
segmental sensory disturbances sequelae cognitive, or 
psychiatric 
sequelae 

Rhabdoviruses 
Rabies Global, 60,000 (probably an Severe encephalitis, which is almost 100% fatal Fatal Fatal 

greatest in underestimate) 

sub-Saharan 

Africa, 

Southeast 

Asia and 

Latin 

America 


Herpesviruses 


HSV encephalitis Global 


VZV Global 


Congenital Global 
cytomegalovirus 


Present in all countries 
where HSV testing has 
been performed, but no 
reliable global estimates 


No reliable global 
estimates 


0.6-0.7% of live births in 
high-income countries 
and 1-5% of live births 
in LMICs 


If untreated, as in most LMICs, there is a high fatality 
rate for HSV-1 (around 70%), lower (around 15%) if 
treated. Long-term neurological complications occur 
in around 70% of adult survivors, including seizure 
disorder and hemiparesis. In one cohort, neurological 
sequelae occurred in 63% of infections in children, 
including seizures in 44% and developmental delays 
in 25% 


CNS: stroke, meningoencephalitis, myelitis 
PNS (more common): herpes zoster with chronic pain 


Most common non-hereditary cause of hearing loss in 
children in the United States 

There are no reliable estimates for frequency of 
hearing loss due to the infection in most LMICs 


In one study of adult survivors, long- 
term cognitive sequelae included 
memory impairment (69%) 


Very limited studies with conflicting 
results 


Symptomatic infection, seen in 
10-15% of congenitally infected 
children, is associated with 
significant global developmental 
delay in around 50% of affected 
children 


Personality or 
behavioural 
impairment in 45% of 
adult survivors 


Major depression 


Behavioural problems 


HIV-related 

HIV Global, Annual incidence estimate HIV associated opportunistic infections, aseptic Asymptomatic neurocognitive Delirium, major 
greatest is 2.3 million (95% Cl, meningitis, AIDS encephalopathy, Bell’s palsy, impairment, mild neurocognitive depression, bipolar 
burden in 1.9-2.7 million) with 34 progressive multifocal leukoencephalopathy, primary disorder and HIV-associated disorder (including 
sub-Saharan | million people living with CNS lymphoma, stroke, transverse myelitis, HIV- dementia AIDS mania), 

Africa and HIV/AIDS worldwide, of associated peripheral neuropathy, inflammatory schizophrenia, 

Asia whom 23 million live in demyelinating polyneuropathy, immune substance abuse or 
sub-Saharan Africa and 3.5 reconstitution inflammatory syndrome and vacuolar dependence and 
million live in Southeast myelopathy post-traumatic stress 
Asia disorder 

Cryptococcal Global, Annual incidence Headache, meningismus, intracranial hypertension, Mimicking of vascular dementia, Personality change, 

meningitis greatest estimate: 957,900 in 2009, mental status changes, focal intracerebral and reversible dementia confusional psychosis 
burden in approximately 624,700 granulomas (cryptococcomas), hydrocephalus and mania 
sub-Saharan | deaths annually (communicating and non-communicating), 

Africa and papilledema, sensorineural deafness, cranial nerve 

Asia palsies, motor and sensory deficits, cerebellar 

dysfunction and seizures 
Toxoplasma Global, No reliable global Headache, focal neurological deficit, seizures and Dementia Schizophrenia and 
encephalitis greatest estimates of incidence of altered mental status behaviour disorders 
burden in toxoplasma encephalitis, 

sub-Saharan | but 

Africa and toxoplasma infection 

Asia is present in 14% of the 


population in the United 
States, compared with 
23-47% in some European, 
Latin American and 
African countries 
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Neonatal sepsis 
and meningitis 


Bacterial 
meningitis 


Tuberculous 
meningitis (also 
an opportunistic 
infection in HIV) 


Neurosyphilis 


Global 


Global 


Global, most 
burden in 
sub-Saharan 
Africa and 
Asia 


Global 


Annual incidence 
estimates for south Asia, 
sub-Saharan Africa and 
Latin America: neonatal 
sepsis, 1.7 million 
(uncertainty estimate, 
1.1-2.4 million); neonatal 
meningitis, 200,000; 95% 
Cl, 21,000-350,000) 


Annual incidence 
estimate: 1.2 million 


No reliable global 
incidence estimates; 
highest in countries with 
high prevalence of HIV 
infection 


No reliable global 
incidence estimates; most 
cases occur in HIV-positive 
individuals 


Little data for neonatal sepsis globally, especially 
among those more than 32 weeks gestation or more 
than 1,500g 

23% (95% Cl, 19-26%) of neonatal meningitis 
survivors (or 18,000 children; 95% Cl, 2,700— 
35,000) estimated to sustain moderate to severe 
neurodevelopmental impairment 

In sepsis or meningitis, the primary neurological 
sequelae are cerebral palsy, impairment to vision, 
hearing and motor function, and seizure disorders 


22.8% (IOR, 12.1-29.2%) have at least 1 
neurocognitive sequela at discharge, 19.9% (IQR, 
12.1-35.2%) have at least 1 sequela post-discharge; 
16.0% (IOR, 7.1-21.2%) have at least 1 major sequela 
at discharge, 12.8% (IOR, 7.1-21.1%) have at least 1 
major sequela post discharge 

Neurological sequelae include motor deficits, hearing 
loss and visual disturbances 

Risk of major sequelae is higher in Africa (25.1%) and 
southeast Asia (21.6%) compared with Europe (9.4%) 


Neurological sequelae in 53.9% of child survivors 
(95% Cl, 42.6-64.9) 

Gross and fine motor impairment in children 

Motor deficits, optic atrophy, ophthalmoplegia, and 
hearing impairment in adults and older children 


Meningitis, cerebrovascular infarction, and paresis, 
tabes dorsalis (ataxia, paraesthesia and bladder 
dysfunction) 


Limited studies reporting cognitive 
impairment; developmental delay 
or learning difficulties are frequent 
in sepsis (30.0%; IOR, 26.4-44.4%) 
and meningitis (33.3%; |OR, 
26.7-36.8%) 


In children, cognitive impairment 
including low IQ, academic 
limitations, and impared executive 
function and in adults, cognitive 
impairment with slower cognitive 
speed seen 


Cognitive impairment in all areas 
tested, and poor scholastic progress 


Impaired memory, disorientation 
and dementia 


No data 


Behavioural changes 
and emotional 
disturbance including 
ADHD and learning 
difficulties 


Emotional disturbance 


Dementia, depression, 
delirium, mania and 
psychosis 


Neurocysticercosis 


Malaria 


STH infection 


Schistosomiasis 


Global, 
greatest 
burden in 
pig-raising 
areas 

with poor 
sanitation 


Sub- 
Saharan 
Africa, Latin 
America, 
Asia and 
Oceania 


Global, 
greatest 
burden in 
sub-Saharan 
Africa and 
Southeast 
Asia 


Global, 
greatest in 
sub-Saharan 
Africa and 
Southeast 
Asia 


2010 prevalence estimate: 
1.4 million (95% Cl, 1.3-1.6 
million) (epilepsy only) 


Annual incidence 
estimate: 216 million 


Estimated 2010 
prevalence: hookworm 
infected 439 million (95% 
Cl, 406-480), Ascaris 
lumbricoides infected 

819 million (95% Cl, 
772-892) and Trichuris 
trichiura infected 465 
million (95% Cl, 430-508). 


Estimated 2010 
prevalence: 252 million 
infected 


Among people with symptomatic neurocysticercosis 
diagnosed with brain imaging: seizures and epilepsy 
(78.8%; 95% Cl, 65.1-89.7%), headaches (37.9%; 

95% Cl, 23.3-53.7%), focal deficits (16.0%; 95% Cl, 
9.7-23.6%) and symptoms associated with increased 
intracranial pressure (11.7%; 95% Cl, 6.0-18.9%) 


Cerebral malaria: 5-28% of children have 
neurological deficits on discharge, including epilepsy, 
acute hemiparesis, hypertonia, cortical blindness 

and ataxia 

By 6-month follow-up the percentage of children 
with deficits has decreased to 0-4.4%, primarily in the 
areas of gross motor and fine motor skills 
Uncomplicated malaria: motor skills 


Not described 


Acute schistosomal encephalopathy: headache, 
confusion, seizure, loss of consciousness, focal 
deficits, visual impairment and ataxia 

Cerebral schistosomiasis: headaches, motor deficits, 
visual abnormalities, seizures, altered mental status, 
vertigo, sensory impairment, speech disturbances 
and ataxia 

Spinal cord schistosomiasis: lower limb weakness, 
bladder dysfunction, lower limb paraesthesia, 
hypoaesthesia/anaesthesia, deep tendon reflex 
abnormalities, constipation and impotence in 80% 
of cases 


Case reports of cognitive decline 
Cognitive symptoms of 
neurocysticercosis with active cysts: 
affects naming, verbal fluency and 
non-verbal memory 


Cerebral malaria affects general 
cognition, attention, working 
memory, visual spatial skills, 
somatosensory discrimination, 
speech and language, and receptive 
and expressive language 

Thirteen IQ point difference from 
non-affected children 1 year 

after episode, and around 26% of 
children have impairment 2 years 
after 

Severe malaria with neurological 
involvement affects executive 
function 

Severe malarial anaemia affects 
overall cognition estimated to 

lead to the equivalent of an 111Q 
point difference from non-affected 
community children 

Malaria with multiple seizures leads 
to speech and language problems 
Malaria with impaired 
consciousness leads to attention/ 
language problems 
Uncomplicated malaria leads to 
language problems 

Asymptomatic malaria leads 

to problems with fine motor 
coordination, attention and 
abstract reasoning 


School-aged children: T. trichiura 
and A. lumbricoides affected 
learning and verbal memory in 
one study 

In another, general STH infection 
reduced memory capacity, rate of 
processing and attention 


For Schistosoma japonicum 
infection in children (not 
neurological infection): verbal 
memory and verbal fluency 
affected 


Neurocysticercosis 
with active cysts: 
dementia (12.5%) and 
cognitive impairment, 
but not dementia 
(27.5%); psychosis 


Cerebral malaria: 
internalizing and 
externalizing 
problems, ADHD, 
disruptive behaviour, 
psychosis and 
depression 


Children under 5 years 
of age: social and 
emotional 
disturbances 
(combined with 
anaemia) 


No data 


ADHD, attention deficit disorder; Cl, confidence interval; CNS, central nervous system; HSV, herpes simplex virus; IOR, interquartile range; LMICs, low- and middle-income countries; PNS, peripheral 
nervous system; STH, soil-transmitted helminths; VZV, varicella-zoster virus. Prevalence estimates are typically used (for example, STH infections and schistosomiasis) because accurate incidence 
numbers for these infections are difficult to obtain. 
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Table 2 | Potential areas for intervention in infectious diseases that affect the nervous system. 


Disease Vaccine available 


Control of zoonotic reservoirs 


Control of vector Treatment 


populations 


Dengue New dengue vaccines being tested in large field trials NA Yes None available 
Chikungunya No NA Yes None available 
Japanese encephalitis Yes No Yes None available 
Rabies Yes Yes NA None available 
HSV encephalitis No NA NA Yes 
VZV Yes NA NA Yes 
Congenital cytomegalovirus No NA NA Yes 
HIV-related 
HIV No NA NA Yes 
Cryptococcal meningitis No NA NA Yes 
Toxoplasma encephalitis No Yes NA Yes 
Neonatal sepsis and meningitis | No NA NA Yes 
Bacterial meningitis Yes, for Haemophilus influenzae type b, and pneumococcal NA NA Yes 

(multiple serotypes) and meningococcal (A, C, Y and W135) 

meningitis 
Tuberculous meningitis Partial protection provided by BCG vaccination Infrequent (cases due to Mycobacterium bovid and NA Yes 

Mycobacterium caprae, both of which are present in cattle, 
reported) 
Neurosyphilis No NA NA Yes 
ES on Sn ee as 

Neurocysticercosis No Porcine vaccine trials underway, pig treatment available NA Yes 
Malaria RTS,S vaccine had efficacy in phase III studies and other NA except for Plasmodium knowlesi Yes Yes 

vaccines are being developed 
STH No. Hookworm vaccine is in phase | trials, but is linked to NA except for Toxocara canis NA Yes 

adverse events 
Schistosomiasis No, but phase | vaccine trials are ongoing Bovine vaccine trials underway for Schistosoma japonicum Yes Yes 


BCG, Bacillus Calmette—Guérin; HSV, herpes simplex virus; NA, not applicable; STH, soil-transmitted helminth; VZV, varicella-zoster virus. 


Soil-transmitted helminths (STH) affect millions, most frequently 
children. Determining the part played by STH in cognitive impairment 
of children is complicated because STH infection is associated with 
many confounders, but data that support a role for STH in neurobe- 
havioural outcomes include a study showing that infants and children 
under 5 years of age with anaemia and STH infection show disturbed 
social and emotional behaviour. Another study showed that treating 
school-aged children with antiparasitic drugs and iron supplementa- 
tion improved attention, memory and processing speed*?. 


GLOBAL RESEARCH PRIORITIES 

Prevention of infections that affect the nervous system is the highest 
research priority, as complete prevention of infection removes all risk 
of nervous system sequelae. However, treatment of nervous system se- 
quelae and rehabilitation of individuals with nervous system morbidity 
are also important for the millions who currently live with the nervous 
system effects of infections. Prevention and treatment of infections 
that affect the nervous system requires the identification of the path- 
ogens responsible, the pathogen reservoirs and the potential points at 
which the pathogen life cycle can be interrupted. Table 1 lists specific 
pathogens, their known nervous system manifestations, and current 
knowledge gaps regarding incidence and long-term sequelae for each 
pathogen. Table 2 outlines whether specific interventions (vaccines, 
control of zoonotic reservoirs or vector populations, and treatment) 
are available for each pathogen. Table 3 provides a summary of global 
research priorities for infections that affect the nervous system. These 
research priorities are discussed in more detail below. 


Diagnosis 
Improved diagnosis lies at the heart of all research priorities for infec- 


tions that affect the nervous system because all other research areas 
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depend on accurate infection diagnosis. Improved diagnosis requires 
better tests to detect infection, better clinical diagnostic algorithms to 
detect infection and better tools to assess the nervous system effects of 
infection, including cognitive and mental health sequelae. 

Affordable, easy-to-use, rapid diagnostic assays — preferably point- 
of-care — that can identify infections affecting the nervous system are 
a high priority. This includes diagnostic tests for infections that directly 
infect nerve cells and those that do not (for example, malaria and STH). 
For infections that affect the CNS, field diagnoses are needed to identify 
when the infection has entered the CNS. Serological assays are availa- 
ble to detect schistosomiasis** and cysticercosis, but these tests are not 
specific for CNS infection, and the blood-brain barrier may prevent the 
detection of antigens in serum*°. In the case of bacterial, fungal or viral 
CNS infections, although lumbar puncture to obtain cerebrospinal fluid 
is aroutine procedure at many centres in LMICs, most lack the capacity 
for standard bacterial, fungal or viral cultures, let alone more sophisti- 
cated testing such as PCR, which is essential for detecting many viral 
infections. Even in high-income countries with advanced molecular di- 
agnostics, an aetiological agent is identified in less than half of individu- 
als with encephalitis. Because many cases of idiopathic encephalitis are 
probably caused by viruses still to be characterized, the development 
of metagenomic and high-throughput screening techniques for viral 
detection is a research priority, with the goal of eventually developing 
low-cost diagnostic point-of-care assays for the pathogens identified. 
For certain CNS infections, notably neurocysticercosis, neuroschistoso- 
miasis, CNS tuberculosis and CNS toxoplasma infection, neuroimaging 
with CT scans or MRI is needed to make a diagnosis. Although availa- 
bility of neuroimaging is becoming more widespread, many facilities in 
LMICs still lack these costly imaging modalities, underscoring the need 
for research on simple, accurate, low-cost, point-of-care diagnostic 
tests for detecting infections that affect the nervous system. 
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One example of how improved diagnostic tools can have an impact 
is astudy* from India in which a simple diagnostic algorithm and basic 
treatment for neonatal sepsis, all performed by village health workers, 
led to a 63% reduction in neonatal mortality among preterm infants". 
Simple algorithms for other infections that affect the nervous system, 
coupled with the ability to provide effective therapy following diag- 
nosis, or appropriate referral for screening algorithms, have the po- 
tential to substantially reduce morbidity and possibly mortality from 
these infections. Improved cross-cultural measurements of neurode- 
velopment and mental health are a key research priority, and reviewed 
in the article in this collection on child neurodevelopment (see page 
S155). 


Epidemiology and primary prevention 

The lack of affordable, non-invasive, rapid diagnostics for infection 
and nervous system effects of infection limits our ability to quantify 
the burden of infection-related nervous system disability (Table 1). 
Well-designed studies of disease epidemiology are also required for the 
accurate measurement of disease incidence, and of the type and dura- 
tion of nervous system sequelae of infection. 

A challenge to the estimation of infection-related nervous system 
disease is that the symptoms these infections result in, such as epilep- 
sy, hemiparesis or cognitive impairment, are included as ‘chronic dis- 
eases’ in global burden estimates. Careful epidemiological assessment 
could lead to the more accurate attribution of a portion of ‘chronic 
disease’ to its infectious component. For example, the Global Burden of 
Disease Study 2010 (ref. 42) attributed some of the disability-adjusted 
life years of epilepsy to neurocysticercosis. This infection is also asso- 
ciated with stroke**, which was ranked third in terms of disability-ad- 
justed life years in 2010 (ref. 42), but none of this burden was attrib- 
uted to neurocysticercosis owing to lack of data. Thus the true burden 
of nervous system disease owing to neurocysticercosis was probably 
significantly underestimated. 

The most cost-effective method of preventing infection is immu- 
nization, discussed in the section on vaccines below. For infections 
for which there is no immunization, or for which immunization is 
not highly successful, research is required on sustainable preventive 
methods. For vector-borne illness, for example, insecticide-based in- 
terventions such as insecticide-treated bed nets have reduced malaria 
incidence and mortality in many areas“. But increasing pyrethroid re- 
sistance* highlights the need for ongoing research even for interven- 
tions with documented past success. 


Pathogenesis 

Disease pathogenesis may be the most neglected research focus of in- 
fection-related nervous system disease in LMICs. Although some stud- 
ies on the pathogenesis of infection-related nervous system disease in 
individuals in high-income countries are available**’, far fewer studies 
of pathogenesis have been conducted in individuals from LMICs. Even 
in high-income countries, studies of infection pathogenesis often use 
animal models, which may incompletely recapitulate the host response 
in humans. The host immune response probably contributes to both 
defence against invading pathogens and subsequent damage to the 
nervous system**, but the type and role of specific cells in the immune 
response at different infection stages are poorly described. Similarly, 
it is often unclear which antigens or components of the infecting or- 
ganism confer neurovirulence. The roles of innate immunity, the mi- 
crobiome, and co-infection with endemic pathogens, including HIV, in 
contributing to infection-related nervous system disease are also poor- 
ly understood (Box 1). Without an understanding of the pathogenesis 
of infection-related nervous system disease, it is difficult to rationally 
plan for adjunctive interventions to prevent or reduce nervous system 
injury. Although adjunctive interventions have been elusive, those 
proven successful (for example, steroid treatment in tuberculous men- 
ingitis!?) have made a major difference in improving neurocognitive and 
behavioural outcomes. An understanding of the development and types 


527|7578|19 November 2015 


JOHN ETAL. | INFECTIONS AFFECTING THE NERVOUS SYSTEM 


Table 3 | Global research for infections that affect the nervous system. 


Priority area Research needed 


¢ Rapid, accurate, low-cost, point-of-care diagnostic tests for infections that 
affect the nervous system 

¢ Clinical diagnostic algorithms for infections that affect the nervous system 

Improved testing for detection of infection-related nervous-system 

disabilities 


Diagnosis 


Accurate incidence and prevalence estimates of common infections that 
affect the nervous system 

¢ Accurate identification and frequency estimates of nervous-system 
manifestations and sequelae 

Identification of potentially modifiable risk factors specific to infections 
that affect the nervous system 


Pathogenesis ¢ Identification of host response pathways that lead to nervous-system 
deficits or to clinical immunity 
¢ Identification of pathogen factors that lead to nervous-system deficits or 
to clinical immunity 
e Assessment of risks and interactions of co-infections and co-morbidity 


¢ Develop safe and effective vaccines based on immunology, epidemiology 
and pathogenesis studies 

¢ Phase | and II trials 

¢ Phase Ill trials 


Treatment e Effective adjunctive treatment to prevent or decrease nervous-system 
deficits or disabilities 
¢ Low cost, low toxicity antimicrobials that work against drug-resistant 
pathogens 
¢ Multi-site, large clinical trials that provide definitive answers on 
interventions 
Effective or improved primary treatment of infection 


Effective and feasible physical, occupational and cognitive rehabilitation 
programmes 


Operations and ¢ Optimal methods to implement or operationalize interventions with 
implementation known efficacy 


of protective immune responses to antigens or antigenic variants of a 
pathogen is also fundamental to the development of vaccines, which 
are, in most cases, the most cost-effective method of preventing infec- 
tion. 


Vaccine development 
Vaccines are available to prevent the neurological complications of 
measles, mumps, rubella, poliomyelitis and varicella virus as well as H. 
influenzae type b, S. pneumoniae and N. meningiditis. Effective vaccines 
are also available for rabies and Japanese encephalitis. Together, these 
vaccines have saved millions of lives as well as prevented long-term 
nervous system complications in millions of children and adults. 
Research priorities for vaccine development include the utiliza- 
tion of disease immunology, epidemiology and pathogenesis studies 
to develop safe, effective vaccines, and the performance of phase I, II 
and II trials to determine vaccine efficacy and safety in humans. In 
P. falciparum malaria, for example, knowledge of antibody and T-cell 
immune responses to circumsporozoite protein (CSP)*°*! led to phase 
I and II trials of the CSP-based RTS,S vaccine. These successful trials 
led to the recently completed phase III trials of RTS,S**. This constituted 
a major advance in the vaccine field because they established RTS,S 
as the first successful vaccine in humans against a parasite. However, 
the relatively modest efficacy (30-50%) of RTS,S was not surprising in 
light of the known complexity of the human immune response to P fal- 
ciparum in endemic populations. Hence, work continues on the devel- 
opment of more effective vaccines. Understanding the human immune 
response to P falciparum infection will be key to the development of 
vaccines with improved efficacy and safety. 


Treatment 

Treatment with antimicrobials is designed to clear infection or reduce 
infectious load, decrease disease severity, and ideally to provide a de- 
gree of secondary prevention against the nervous system effects of 
the infection. For viral infections, with the exception of HSV and HIV, 
there is often no specific treatment. Even for HSV encephalitis, stand- 
ard treatment (intravenous acyclovir) is unavailable in many parts of 
the world. Cost effectiveness and stakeholder analyses could be use- 
ful in influencing policymakers to increase availability of antiviral 
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BOX 1 | EMERGING RESEARCH AREAS 


Diarrhoeal disease 

Diarrhoea is a leading cause of mortality in children living in 

low- and middle-income countries (LMICs)”. Most of these deaths 
occur in Africa, Southeast Asia and in eastern Mediterranean 
countries. In 2010, it is estimated that there were 1.731 billion 
episodes of diarrhoea worldwide of which 36 million progressed 
to severe diarrhoea and 700,000 episodes resulted in death™. Its 
occurrence in the first 2 years of life is associated with an 8 cm 
decrease in height and a 10 point drop in IQ by the time children 
are around 7 to 9 years old®. The mechanism by which diarrhoea 
affects cognition is not clear, but it could be through the effect of 
diarrhoea on stunting, which in turn predicts future cognition®. 
However, during diarrhoea-free periods in the first 2 years of life, 
children experience catch-up growth and may return to their 
original growth trajectories’’. This highlights the importance 

of effective interventions for diarrhoea to sustain the child's 
developmental potential. The high frequency of diarrhoea episodes 
during this critical developmental stage and the large number 

of cases makes diarrhoea a major public health concern for child 
development. 


The microbiome 

Although it has been clearly demonstrated that pathogenic 
microbes can cause brain disorders, there is increasing evidence 
that the microbial population harboured in the human body, termed 
the human microbiome, can as a whole influence brain activity™. 
Recent clinical studies among healthy subjects suggest that 
treatment with a probiotic is associated with reduced symptoms 

of stress and depression’. There is also evidence of associations 
between the microbiome and neurological diseases, such as 
multiple sclerosis and autism spectrum disorder (ASD)’”°. In a recent 
study using a mouse model of ASD, treatment with probiotics 
alleviated some behavioural symptoms of the disorder”. The 
composition of the human microbiome shows marked differences 
between countries” and comparative research conducted in high- 
income countries and LMICs could lead to a better understanding 
of the part played by the human microbiome in brain disorders, and 
possible treatment of these disorders with factors that favourably 
alter the microbiome. 


treatment. For HIV-associated opportunistic infections such as cryp- 
tococcal meningitis, development of therapies that do not rely on in- 
travenous administration is a priority because capacity for intravenous 
medication is limited in many LMICs, particularly in rural areas. There 
is also a need for improved access to new assays that detect antiretro- 
viral therapy resistance (for example, the oligonucleotide ligase assay), 
as these can guide cART treatment. For many parasitic infections, an- 
tiparasitic medications are available, but their efficacy in reducing the 
neurological or cognitive sequelae remains uncertain. For this reason, 
as noted in the pathogenesis section, development of adjunctive thera- 
pies that target prevention or reduction of nervous-system injury is an 
important research priority. 

Development of low-cost, low-toxicity antimicrobials that work 
against drug-resistant pathogens is a research priority for several in- 
fections, including tuberculous meningitis and neonatal sepsis caused 
by multiresistant Gram-negative infections. Qualitative studies to bet- 
ter understand medical non-compliance, and to develop innovative 
solutions to reduce non-compliance through newer technologies, such 
as mobile devices that support medical and public-health practice, are 
also needed. 

Finally, there is a need for multicentre clinical research trials with 
sufficient sample sizes to provide definitive answers on the efficacy of 


S184 


specific interventions. For example, where smaller trials had failed, the 
Cryptococcal Optimal Anti-Retroviral Timing (COAT) trial conducted 
in Uganda and South Africa successfully determined that deferred ini- 
tiation of anti-retroviral therapy in individuals with HIV until 5 weeks 
after treatment of cryptococcal meningitis improved survival**. This 
study finding is likely to change international guidelines. 


Physical, occupational and cognitive rehabilitation 
Whereas physical, occupational and cognitive rehabilitation for indi- 
viduals with sequelae of CNS infections are routine in developed coun- 
tries, such interventions are limited in LMICs owing to a lack of trained 
personnel and prohibitive costs. Thus, research on how to build capac- 
ity for rehabilitation and how to support it in the context of LMICs is 
required. Rehabilitation for cognitive impairment can be successfully 
implemented in the community using locally available resources or 
in a tertiary institution using advanced methods. In the community, 
home stimulation, parenting education and support, and provision of 
financial support or nutritional support for children enrolled in early 
child development centres have shown some benefit in improving chil- 
dren’s cognition®**. Interventions that target both the carer and the 
child are more effective than those that include either one®®. In tertiary 
centres, computer-based cognitive training programmes have proven 
effective in improving cognition in African children surviving CNS in- 
fections*”**. These cognitive training programmes can target specific 
disabilities; however, they are in their early stages and more research 
is required to determine the most cost-effective implementable and 
sustainable programmes for LMICs. 


Operations and implementation research 
Operationalization and implementation of known effective interven- 
tions is another research priority area. Vaccines for S. pneumoniae and 
N. meningiditis are highly effective and have been implemented in some 
LMICs, but these vaccine-preventable infections continue to affect 
more than 1 million people each year. Thus, in addition to increased 
investment in the basic science of vaccines, a major research priority 
is the assessment of methods to support and implement widespread 
vaccination in LMICs. 

Another example is implementation research related to effective 
prenatal, perinatal and neonatal care, which would decrease neona- 
tal sepsis. Assessment of effective methods for non-physician health 
workers to provide medical and preventive care to mothers and new- 
borns in LMICs are needed®. The recent increase in neonatal infection 
with multi-drug-resistant Gram-negative organisms in LMICs**?5 
makes prevention of neonatal sepsis an even greater research priority. 
Rapid diagnostics and treatment interventions that are successful in 
field trials also require implementation research for successful wide- 
scale adoption and appropriate use. 


Capacity building 

Capacity building within LMICs is key to the successful reduction or 
elimination of nervous system complications of infection. The Review 
on research capacity building in this collection addresses this topic in 
depth (see page S207). Research priorities specifically in the area of in- 
fection-related nervous system morbidity include an increase in the 
number of clinicians and researchers in infectious disease, neurosci- 
ence, neurology and mental health®, and the dedication of a portion 
of LMICs health budgets to infectious disease and mental and neuro- 
logical health . Research training grants and collaborative research 
between partners in LMICs and high-income countries specifically in 
the area of infection-related neurocognitive impairment can also help 
to build human resource capacity. Physical infrastructure is another 
priority; without space for laboratories, diagnostic equipment or re- 
search clinics, surveillance cannot be performed and information to 
guide interventions to reduce the burden of these infections cannot be 
generated. With improved human resource capacity and infrastruc- 
ture, the development of effective screening instruments, prevention 
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and treatment, and increased government support to address these in- 
fections are more likely to be achieved. Box 2 provides examples of ca- 
pacity building in Peru and Uganda in the area of infections that affect 
the nervous system. This was enabled by Fogarty International Center 
grants for collaborative US-LMICs partnerships in these countries. 


CONCLUSIONS 

In the past two decades, an increasing body of evidence implicates in- 
fection as a cause of substantial nervous system morbidity in high-in- 
come countries and LMICs. The burden is particularly high in LMICs, 
where infections such as HIV (and associated opportunistic infections), 
HSV, dengue, bacterial and tuberculous meningitis, malaria, neuro- 
cysticercosis, STH, schistosomiasis and other infections affect billions of 
people annually, and cause substantial neurological morbidity in those 
individuals. The involvement of the nervous system in infections is of- 
ten first recognized in LMICs where the prevalence of these infections 
is higher and where new infections often emerge. Research conducted 
in these countries can contribute to prevention and cure before these 
infections become globalized. Research is needed in multiple areas to 
determine the true burden of disease and to develop point-of-care diag- 
nostic assays for diagnosing infection, vaccines and other interventions 
for preventing infections; to improve our understanding of the patho- 
genesis of nervous system disease in these infections; to develop better 
tools for the assessment of neurological, cognitive and mental health 
impairment; to develop more effective treatments and preventions for 
nervous system sequelae, to improve the implementation of successful 
interventions, and to improve rehabilitation for those with long-term 
neurocognitive or mental health disabilities. Good research studies in 
these areas, accompanied by equally strong efforts to implement prom- 
ising technologies and therapies, could substantially decrease the mor- 
bidity and mortality of infections affecting the nervous system in LMICs. 
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BOX 2 | CAPACITY BUILDING IN UGANDA AND PERU 
Uganda 


The Severe Malaria Research Centre in Uganda is an example of 
how collaborations between local and foreign scientists, with 
support from the Fogarty International Center, has built a hub for 
research excellence. Through a National Institutes of Health (NIH) 
R21 exploratory research grant in 2004, local scientists developed 
research capacity by involvement in research studies, and grant and 
manuscript writing. This has since led to further NIH grants (four 
RO1 grants, a U01 grant, a D43 grant and two R34 grants) as well as 
multiple grants from other agencies. Ugandan and US faculty are 
principal investigators on these grants. A book on neuropsychology 
of African children and more than 30 research papers have been 
published from these projects so far. Ugandan scientists and 
physicians have obtained faculty positions in Makerere University, 
Kampala. The infrastructure that has been built includes high-speed 
Internet connectivity for research offices and faculty members, 

a laboratory, a data room and a grants management office. With 
the infrastructure in place, this centre is now providing training 

for many Ugandan and US students and researchers at all levels of 
training, from undergraduates to post-doctoral fellows and faculty. 


Peru 

Through the Fogarty International Center NIH R21 and R01 grants 
in Peru, a network of neurologists who are engaged in brain- 
disorder research has been developed throughout the country. 
Trainee alumni of this network now serve as collaborators on 
emerging research and training activities in both infectious and 
chronic diseases of the nervous system (such as cerebrovascular 
diseases). The 2 sites in Lima have been scaled up to 12 hospitals 
and 2 universities in 3 Peruvian regions. Capacity building of 
individuals was provided through workshops, hybrid virtual/ 
in-person certificate courses, as well as medium- and long-term 
training in Seattle and Peru. An initial mentor-training workshop 
developed into a growing network of mentors, three of whom have 
been awarded Clayton—Dedonder Mentorship Fellowships by the 
Fogarty International Center and have started institutionalization 
of mentor training programmes at three institutions in Lima. 
Those who received the R21 and RO1 grants in the past are now 
experienced researchers who are leading the development of 
research in new areas, such as neurogenetics research and the 
development of a cerebrovascular diseases research training 
programme. Research supported by these awards resulted in 26 
peer-reviewed publications and book chapters. Programme alumni 
are becoming leaders in brain research and are mentoring the 
newest wave of young neurologists and neuroscientists. 
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A global perspective on the influence of 
environmental exposures on the nervous system 
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Economic transitions in the era of globalization warrant a fresh look at the neurological risks associated with environmental 
change. These are driven by industrial expansion, transfer and mobility of goods, climate change and population growth. In these 
contexts, risk of infectious and non-infectious diseases are shared across geographical boundaries. In low- and middle-income 
countries, the risk of environmentally mediated brain disease is augmented several fold by lack of infrastructure, poor health and 
safety regulations, and limited measures for environmental protection. Neurological disorders may occur as a result of direct 
exposure to chemical and/or non-chemical stressors, including but not limited to, ultrafine particulate matters. Individual sus- 
ceptibilities to exposure-related diseases are modified by genetic, epigenetic and metagenomic factors. The existence of several 
uniquely exposed populations, including those in the areas surrounding the Niger Delta or north western Amazon oil operations; 
those working in poorly regulated environments, such as artisanal mining industries; or those, mostly in sub-Saharan Africa, re- 
lying on cassava as a staple food, offers invaluable opportunities to advance the current understanding of brain responses to en- 
vironmental challenges. Increased awareness of the brain disorders that are prevalent in low- and middle-income countries and 
investments in capacity for further environmental health-related research are positive steps towards improving human health. 
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eports from the World Health Organization (WHO) indicate that 

the global burden of disease is determined by patterns of disease 

and disability in low- and middle-income countries (LMICs), 
which, predictably, have their own environmental signatures (http:// 
www.who.int/healthinfo/global_burden_disease/about/en/). How- 
ever, the effect of such signatures on both brain health and region or 
global disability-adjusted life years (DALYs) remains unknown and 
needs to be added to the agenda of global environmental health re- 
search. As for high-income countries, environmental health research 
programmes in LMICs must primarily focus on elucidating the entire 
range and source of exposures to define the human ‘exposome’ (the 
measure of all the exposures of an individual in their lifetime and how 
these exposures relate to health) relevant to brain health in LMICs. The 
research agenda should also include mechanistic and translational re- 
search, as well as capacity building to foster a new generation of envi- 
ronmental health scientists. 


SCOPE 

In this Review, we focus on environmental risk factors for brain diseases 
and conditions in LMICs (http://data-worldbank.org/about/country- 
and-lending-groups). An iterative search of the literature was con- 
ducted using PubMed to retrieve information related to environmental 
determinants and mechanisms of brain disease in LMICs. Additional 
opinion was obtained from interviews with leading environmental 


scientists and neuroscientists, as well as programme officers at the US 
National Institutes of Health and US National Institute of Environmen- 
tal Health Sciences (NIEHS), and Fogarty International Center. This 
Review integrates the goals and approaches to environmental health 
research as per the NIEHS 2012-2017 strategic plan (https://www. 
niehs.nih.gov/about/strategicplan/). 


ENVIRONMENTAL EXPOSURE AND BRAIN HEALTH 

LMICs are home to around 80-85% of the world’s population!. Of these 
5.8 billion people’, 1 billion remain in extreme poverty, living below the 
US$1.25 per day poverty line*. Around 3 billion people do not have piped 
drinking water in their home and 173 million people rely on the direct 
use of surface water. Without proper sanitation, about one billion con- 
tinue to defecate in gutters, in the open bush or in open water bodies’. 
Wildfires and deforestation are commonplace and drought and floods, 
possibly due to climate change, degrade the existing farming systems 
and create food insecurity*>’. Armed conflicts and population displace- 
ments impose a toll on human life*®. Industrial expansion coexists with 
an unprecedented rise in artisanal mining and unprotected labour’. 
In some instances, normal urbanization operations, such as road con- 
struction and quarantines (for example during Ebola outbreaks in the 
Democratic Republic of the Congo) have created conditions that exac- 
erbated the risk of environmental exposure and brain disease". Flawed 
regulations compounded by a lack of infrastructure set the stage for 
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Table 1 | Heavy metals and exposure-related outcomes 


Heavy metal 


Lead’*78 


Arsenic®?** 


Cobalt***? 


Source of exposure 


Lead-contaminated dust, lead-based 
paint, soil, drinking water, air, leaded 
gasoline, toys and lead-contaminated 
sweets 


Mining industry, power plants, 
crematoria, charcoal industry, and 
contaminated food (mostly sea food) 
and water 


Contaminated food and drinking 
water, air and arsenic-based 
treatments 


Contaminated drinking water and 
food, uncoated copper cookware and 
infant formula containing copper 


Contaminated drinking water and 
food, inhalation of dust containing 


lity window 


Lifelong 


From neural development to 
neurulation, and adolescence 


5-15 years 


Children 


Those over 65 


Prenatal, young children and the 
elderly 


Neurological outcomes 


Visual and verbal memory decline, 
intellectual deficits, decline in 
executive functioning (fine motor 
function, hand-eye coordination and 
reaction time) and hyperactivity in 
children 


Ataxia in adults and language, 
attention, and visuospatial 
performance deficits in children 


Impaired selective and focused 
attention and long-term memory 
in children, and sensorimotor 
polyneuropathy 


Alzheimer’s disease, OCD, ADHD, 
antisocial behaviour and anxiety in 
children 


Optic, auditory and peripheral 
neuropathy, motor deficits and verbal 


Proposed mec 


Disruption of neurotransmitter 
release and function, and prenatal 
disruption of neuronal migration and 
differentiation; aggravating factors 
include poor nutrition (deficiency in 
iron, zinc and calcium) and younger 
age 


Oxidative stress or impairment of 
intracellular calcium and glutamate 
homeostasis 


Oxidative stress or disruption of 
metabolism of neurotransmitters 


Oxidative stress, microglia 
cell activation or promotion of 
a-synuclein and fibril formation 


Alteration of mitochondrial oxidative 
phosphorylation or depletion of 


cobalt particles in various industries 


Fumes or dust, cigarette smoke, and 


contaminated food and water elderly 


Airborne as fumes, aerosols or 
suspended particulate matter and 
contaminated water 


Manganese’*”® 


Contaminated air, water and food, 
cosmetics (such as antiperspirants), 
metal industries and pharmaceuticals 


Lifelong 


ADHD, attention-deficit hyperactivity disorder; OCD, obsessive compulsive disorder. 


environmental degradation and pollution to pose serious threats — of a 
chemical or non-chemical nature — to human health. The degradation 
of local ecosystems leads to the creation of ‘microenvironments’ that 
have a high risk of harmful exposures, often resulting in unique chal- 
lenges and increased risk of human disease (Fig. 1). 


HIGH-RISK POPULATIONS AND MICROENVIRONMENTS 
Risk of exposure-related brain disease is determined by age, gender 
and microenvironments created by natural disasters in which eco- 
nomic, social and cultural determinants of health often have impor- 
tant roles. One example of a profit-mediated environmental risk is that 
caused by the oil industry through accidental spills or mismanagement 
of oil operations. For instance, crude oil operations have polluted large 
areas of rainforests, including streams and rivers in Ecuador, Peru 
and Colombia". The population of Nigeria has faced similar challenges 
owing to reoccurring oil spills as a result of ageing, ill-maintained or 
sabotaged pipelines in the Niger Delta. The impact of such man-made 
and preventable natural disasters on human health has yet to be deter- 
mined. Effects on human health will depend on the type and composi- 
tion of the spilled oils, which often contain a mixture of polycyclic hy- 
drocarbons that are known to be toxic to the nervous system". Oil spills 
arise owing to reasons, such as a lack of vigilance, neglect of necessary 
health and safety checks, or sometimes even promotion of commercial 
interests at the expense of communities. Symptoms of acute exposure 
to raw oil include consistent episodes of headache, nausea, dizziness 
and fatigue. Chronic effects include psychological disorders, endocrine 
abnormalities and genotoxic effects”. 

Microenvironments in which the population has a higher suscep- 
tibility to exposure-related diseases have also been created by extreme 
poverty and natural disasters, including drought and flooding that can 
degrade soils, plants and farming operations. The burden of conven- 
tional neurodevelopmental stressors (for example, lead) on children 
is exacerbated by unique environmental challenges, including mal- 
nutrition and enteric infections’ '° and, possibly, a diet of neurotoxi- 
cant-containing plants such as cassava (Manihot esculenta; also known 
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Prenatal, young children and the 


Childhood and the elderly 


memory loss neurotransmitters 


Antisocial behaviour and attention 
impairment in children, parkinsonism 
and peripheral neuropathy 


Oxidative damage and 
neurotransmitter disruption 


Reduced IQ, impaired verbal learning 
and working and immediate memory 
in children, and Parkinson-like 
symptoms 


Disruption of mitochondrial 
respiratory chain reaction; 
aggravating factors include iron 
deficiency and impaired biliary 
excretion (liver injury or disease) 


Alzheimer’s pathology in the form of 
neurofibrillary tangles 


Disruption of mitochondrial 
respiratory chain reaction or 
inflammation; zinc deficiency 
acknowledged as an aggravating 
factor 


as tapioca), the grass pea Lathyrus sativus or the seeds from the cycad 
plants, which are all known to be associated with a high burden of neu- 
rodisabilities at a population level!” **. Populations with unique expo- 
sures and risks include those living in the tropical cassava belt of Ango- 
la, the Central African Republic, Cameroon, Congo, Tanzania, Uganda, 
Nigeria and Mozambique* °°; those reliant on L. sativus as a staple food 
in Ethiopia, Eritrea, India and Bangladesh???!’*°; and the people of the 
Pacific island Guam or the Japanese Kii Peninsula where the rates of 
environmentally linked syndromes such as amyotrophic lateral sclero- 
sis-parkinsonism-dementia complex (ALS/PDC) have been declining 
for reasons that have yet to be uncovered****. 

The impact of early childhood diseases that lead to a vicious cycle of 
enteric infections and malnutrition has been underestimated and ne- 
glected, especially in areas that lack acceptable levels of hygiene and 
sanitation and that have reduced accessibility to vaccines and antimi- 
crobials. This has caused clinically silent, chronic-illness-related ef- 
fects, which jeopardize the child’s full cognitive development'*’. This 
vicious cycle establishes what is called environmental enteropathy, a 
mostly subclinical condition (even without diarrhoea) caused by var- 
ious degrees of intestinal barrier dysfunction, luminal-to-blood in- 
testinal bacterial translocation, low-grade local and systemic inflam- 
mation, and disrupted innate intestinal immune responses that may 
affect growth” and cognition” and possibly lead to neurodegeneration 
as well as liver, and metabolic diseases later in life***°. 

Adolescents in LMICs experience a higher burden of exposures (in 
contrast with those in high-income countries), primarily because of 
the childhood labour crisis. Although there are regulations and inter- 
national agreements restricting child labour, often there are exceptions 
for certain industries, notably the growing agricultural industry, one 
of the most hazardous industries worldwide*“'. In this context, ado- 
lescent workers are at risk of exposure to agrochemicals such as pesti- 
cides**3. Other work-related threats include exposure to organic sol- 
vents in work that involves painting and manufacture, to toxic metals 
and fine particulate matters in artisanal mining, and to heat and am- 
bient air pollution while working long hours and outside’. Exposure to 
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Table 2 | Organic compounds and exposure-related outcomes 
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Organic compound 


Bisphenol A®° 


Source of exposure 


Food from cans with linings that 
contain BPA, and contaminated food 
and water 


Food or drink that has been in 
contact with containers or products 
containing phthalates, and air and 
dust containing phthalates 


Contaminated food and water, 
polluted air and professional dermal 
contact 


Organophosphates®*° 


Contaminated food, drinking water 
and air 


Organobromide compounds Contaminated food, water and air 


(PBDEs)? 


Air and professional dermal contact 
and glue sniffing 


Oral ingestion 


Food-born neurotoxicants 


(cassava cyanogenic glucosides 
and BOAA in Lathyrus 
sativus)**** or contaminants 
(fungal toxins) 


lity window 


Prenatal and childhood 


Prenatal and childhood 


Lifelong 


Prenatal and lifelong 


Lifelong 


Adolescence and adulthood 


Lifelong 


Neurological outcomes 


Anxious behaviour, hyperactivity and 
depressive behaviour, and learning 
impairment in children 


Depressive and conduct-related 
behaviours (ODD, attention 
problems, rule-breaking and 
aggressive behaviour in children) 


Neurodevelopmental deficits, 
impaired attention and working 
memory, impaired speed and 
executive functions, and delayed 
peripheral polyneuropathy 


Impaired intellectual ability, ADHD- 
like behaviours and locomotor 
deficits 


IQ deficits, impaired attention, fine 
motor coordination and cognition 
functioning in children 


Headache, memory deficits, and 
central and peripheral neuropathy 


Spastic paraparesis, cognition deficits 
and possibly convulsive disorders 


Proposed mecha 


Unclear, but females seem to be more 
susceptible 


Oxidative stress 


Inhibition of acetyl-cholinesterase 


Disruption of neurotransmitter 
function, oxidative stress 

or derangement of calcium 
homeostasis; children seem to be 
more susceptible. 


Impairment of thyroid hormone 
homeostasis 


Protein adduction and/or oxidative 
misfolding or oxidative stress 


Oxidative stress, excitotoxicity and 
protein carbamylation for cassava 
cyanogens; children and females 
seem to be more susceptible; 
malnutrition is acknowledged as an 
aggravating factor 


ADHD, attention-deficit hyperactivity disorder; BOAA, beta-(N)-oxalyl-amino-t-alanine acid; BPA, bisphenol A; DDT/PCBs, dichlorodiphenyltrichloroethane/polychlorinated biphenols; ODD, oppositional 


defiant disorder. 


Table 3 | Complex exposures and neurological outcomes 


Exposure 


Source of exposure 


Susceptibility window 


Neurological outcomes 


Proposed mechanisms 


Charcoal/coal combustion, gas 
grilling, wood smoke, or coal mine 
dust or ash 


Fine and ultrafine particulate Air pollution from car or construction 
matters? equipment exhausts, wood burning, 


Coal/charcoal burning®®*” 


Lifelong 


Lifelong 


Lifelong 


Neurological signs of exposure to 
arsenic 


Learning disability and motor 
impairment 


Behavioural and decreased IQ, 
impaired fluid cognition, memory 


Oxidative stress 


Oxidative stress or neurotransmission 
disruption 


Oxidative stress, neurotransmission 
disruption or neuroinflammation 


heating oil or coal, forest fires, 
volcanic eruptions, tobacco smoke 
and cooking 


industrial solvents such as n-hexane, for example, may occur because 
of poor safety regulations or recreational glue sniffing. This may result 
in headache, acute encephalopathy or sensorimotor neuropathies that 
are reversible on cessation. 

Adults in LMICs may be at a particularly high risk of environmental 
exposure and related brain diseases compared with those in high-in- 
come countries. In general, they experience a higher burden of disease 
owing to a lifetime of cumulative exposures and co-morbidities that are 
highly prevalent in LMICs. The latter include malaria, nutritional de- 
ficiencies and neurotropic infections such as those caused by human 
T-cell lymphotropic viruses (HTLV). For example, it was reported that 
endemic foci of HTLV-I-associated myelopathy coexist with outbreaks 
of konzo (a spastic paraparesis linked to the toxicity of cassava cyano- 
gens) in some regions of the Congo*5“°. In these areas, women of child- 
bearing age are particularly susceptible to the toxicity of cassava cyano- 
gens for reasons that have not been elucidated, although they may be 
linked to hormonal influences and poor nutrition*’. 


PATHWAYS TO BRAIN DISEASE 

Exposure-related brain damage may result from chemical and/or 
non-chemical stressors. Damage to the nervous system often leads 
to a range of bilateral and symmetrical motor and/or sensory symp- 
toms. Behavioural problems, cognition deficits and psychiatric illness 
may also occur. Non-chemical stressors include, but are not limited to, 
psychological stress, heat, noise, fine and ultrafine particulate matter 
(FUPM), and waterborne, airborne or foodborne pathogens that may 
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and executive functions, and possibly 
autism 


occur under the conceptual framework shown in Fig. 1. Chemicals with 
neurotoxic potential that people are commonly exposed to are listed 
in Tables 1-3. Mixed exposure, for example chemical-covered FUPM 
from industrial emissions; co-exposure to chemical and non-chemi- 
cal stressors; and repeated and multiple exposure can occur, creating a 
complex human environmental exposome. 

Brain damage linked to chemical exposure may result from chem- 
icals interfering with neurotransmission through molecular mimicry 
or reacting with crucial biomolecules and causing incorrect function 
(for example, protein or DNA adduction and/or crosslinking). For 
both chemical and non-chemical exposures, the mechanisms of brain 
damage may include injury to the vascular system (for example, fine 
particulate matter induced vascular pathology), systemic dyshomeo- 
stasis (for example, cadmium-induced kidney disease) and hormonal 
imbalance (for example, through endocrine disruption; Table 1). 

The susceptibility to exposure-related disease is, however, de- 
termined by mechanisms of functional genetics, epigenetics and 
metagenomics at the interface between risk factors and neurological 
outcomes (Fig. 2). 

It is increasingly acknowledged that genetic and epigenetic factors, 
including the effect of maternal stress on brain function, influence the 
effect of environmental exposure***’. For example, the F4 allele of the 
APOE gene that is reportedly associated with higher risk of late-onset 
Alzheimer’s disease, although not in people from sub-Saharan Africa 
and with a mild association among Hispanic people, is associated with 
protection against early childhood diarrhoea and its related cognitive 
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Figure 1 | Environmental (chemical and non-chemical) threats to brain health in low- and middle-income countries. Multiple sources of exposure (air, water and 
food) coexist, and malnutrition and vector-borne diseases, notably infections, compound the risk of brain disease. Co-exposures not shown include heat, psycho- 


logical stress and a poor physical environment, such as crowding. 


impairment® **. One example of gene-environment interactions is the 
relationship between air pollution components and the gene encod- 
ing the MET receptor tyrosine kinase. Several studies have implicated 
MET as an autism risk gene****. Stratification of the risk conferred by a 
functional promoter variant in this gene (rs1858830) and by local traf- 
fic-related air pollution (regional particulate matter less than 10 mi- 
crometres in diameter and nitrogen dioxide exposure) revealed signif- 
icant multiplicative interaction between the risk genotype and the air 
pollution exposure®. 

Our knowledge of the pathways that lead to late onset of expo- 
sure-related neurological disease is still sparse*”**. Many studies sug- 
gest that the genetic and environmental causes of late onset diseases 
act in parallel and share common molecular mechanisms*’. A number 
of studies have supported the concept that early-life exposure to pol- 
lutants reprograms global gene expression in old age through epige- 
netic mechanisms *’. Variation in exposure response, even among 
individuals exposed to the same environment could be due not only 
to early-life exposures, but also to differences in genetic make up™ °°. 
The extent and nature of exposures and related brain diseases in LMICs 
provide opportunities to explore and overcome the long reach that 
childhood exposure has into adulthood, as well as provide us with new 
advances in environmental health sciences*. 

Exposure-related neurological deficits in LMICs range from pe- 
ripheral neuropathies to a large number of acute, subacute or chron- 
ic central nervous system diseases. Deficits may occur prenatally, or 
during childhood or adolescence, and may be carried through to old 
age. Clinical implications include, but are not limited to, neural tube 
defects, learning disabilities, behavioural problems, psychiatric dis- 
orders, cognitive decline and the occurrence of distinct entities such 
as neurolathyrism, tropical ataxic neuropathy, ALS/PDC and kon- 
7.0.20:30,35,68, 69 (Fig. 3). 

The human microbiome may be of particular interest to the mech- 
anistic understanding of exposure-related diseases in LMICs because it 
may influence the burden of heavy metals”, the metabolism of food- 
borne neurotoxicants such as cassava cyanogens'*, and the outcome of 
enteric diseases in early life, including the child’s neurodevelopmental 
potential” ”- 


RESEARCH AND CAPACITY BUILDING 
Recent advances in environmental health sciences have elucidat- 
ed the myriad risk factors and mechanisms of brain damage that are 
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associated with environmental exposures. The existence of uniquely 
exposed populations in LMICs offers invaluable opportunities to ad- 
vance our current understanding of brain responses to environmental 
threats. In some instances, well-characterized neurotoxicants may be 
used as chemical probes to dissect the pathophysiology of the nerv- 
ous system. However, challenges at the population level still remain, 
including setting exposure limits and developing metrics and meth- 
odologies to assess the long-term impact of environmental exposures 
on disease burden in LMICs and, therefore, globally. Climate change 
and mining of rare elements, which may include radioactive materials, 
present unpredictable risks, and should be added to the environmental 
health research agenda. The toll of such exposures on the global bur- 
den of disease may be efficiently addressed only through competent 
partnerships and alliances established on a global scale and focused on 
key areas and priorities (Box 1). Although there is evidence that some 
of these are already in place, more research and research capacity is 
needed to continue this agenda to improve human health, globally. 


ONE HEALTH-GLOBAL HEALTH DIMENSIONS 

Environmental degradation and contamination, changes in climate and 
ecosystems, and vector-born pathogens or neurotoxicants are the pri- 
mary environmental threats to human life and intellectual performance. 


BOX 1 | INTERWOVEN RESEARCH AREAS AND IN- 
VESTMENT PRIORITIES IN GLOBAL ENVIRONMEN- 
TAL RESEARCH 


¢ Epidemiology and statistical modelling for exposure and risk 
assessment in co-exposure and co-morbidity scenarios 

e High-throughput ‘-omic’ methodologies 

¢ Bioinformatics and knowledge management 

e Development of diagnostic and remediation tools — validation 
and implementation of environmental sensors, detectors and 
biomonitors of exposure and related outcomes 

e Development of metrics and methodologies to assess the long- 
term impact of environmental exposures on neurological disease 
burden 

e Understanding the pathways that lead to late onset of exposure- 
related neurological diseases 

e Training and capacity building in the areas listed above 
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Figure 2 | Environmental framework and pathways to environmentally induced 
neurological disease in low- and middle-income countries. Susceptibility to 

neurological disease is determined at the interface between a particular expo- 
sure, epigenetic and metagenetic make up, and the presence of co-morbidities. 


Humans, plants and animals adapt to environmental challenges, but 
some may overcome their adaptive capabilities and create imminent 
risks for all”. Strategies to promote human health will therefore re- 
quire a serious commitment to trans-disciplinary work, plant and ani- 
mal health and building capacity on a global scale”. 
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Figure 3 | Neurocognition deficits in konzo, a disease linked to eating cyano- 
genic cassava. a, Spasticity in a 14-year old boy severely affected by konzo. b, 
Deficits in mental processing are evident from the results of a neuropsycho- 
logical test. 
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Global neurotrauma research challenges and 
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Traumatic injury to the brain or spinal cord is one of the most serious public health problems worldwide. The devastating im- 
pact of ‘trauma’, a term used to define the global burden of disease related to all injuries, is the leading cause of loss of human 
potential across the globe, especially in low- and middle-income countries. Enormous challenges must be met to significantly 
advance neurotrauma research around the world, specifically in underserved and austere environments. Neurotrauma research 
at the global level needs to be contextualized: different regions have their own needs and obstacles. Interventions that are not 
considered a priority in some regions could be a priority for others. The introduction of inexpensive and innovative interventions, 
including mobile technologies and e-health applications, focused on policy management improvement are essential and should 
be applicable to the needs of the local environment. The simple transfer of a clinical question from resource-rich environments 
to those of low- and middle-income countries that lack sophisticated interventions may not be the best strategy to address these 
countries’ needs. Emphasis on promoting the design of true ‘ecological’ studies that include the evaluation of human factors in 
relation to the process of care, analytical descriptions of health systems, and how leadership is best applied in medical communi- 


ties and society as a whole will become crucial. 
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he global burden of disease (GBD) related to all injuries or ‘trau- 

ma’ is the leading cause of loss of human potential around the 

world especially in low- and middle-income countries (LMICs). 
According to the 2010 report of the GBD study 89% of trauma-related 
deaths occur in LMICs. Nearly 6 million people die each year as a re- 
sult of trauma, accounting for 10% of the world’s deaths — 32°% more 
than the number of fatalities from malaria, tuberculosis and HIV/AIDS 
combined. 

Within the spectrum of trauma-related injuries, traumatic brain in- 
jury (TBI) and spinal cord injury (SCI) are the largest causes of death and 
disability, leading to suffering by, and costs to, the individual, their fam- 
ily and society. Social costs include changes to the family care structure 
owing to cognitive, emotional and/or physical disabilities in addition to 
economic costs and reduced productivity. The incidence of central nerv- 
ous system (CNS) injuries varies between regions, with estimates ranging 
from 200 to 600 injuries per 100,000 people. The data are sparse and the 
true incidence of both TBI and SCI may be considerably underestimated’. 

Efforts to quantify the magnitude of TBI are hindered by several 
factors, the most common of which is related to the lack of consist- 
ent data recording where this occurs*°. For example, the absence of 
formal injury surveillance or reporting systems (trauma registries) in 
some high-income countries as well as in LMICs, leads to an under- 
estimate of the true magnitude of CNS burden of disease worldwide. 
Despite 89° of the trauma population being in LMICs, pre-hospital 
mortality for CNS injuries is not systematically recorded in research 


that originates in these countries. Even fewer LMICs have formally im- 
plemented a data-specific registry for neurotrauma. In addition, most 
patients with TBI have mild to moderate injury and are therefore often 
not reported®’. CNS injuries in patients with multiple trauma, espe- 
cially as a result of military or civilian conflicts, may be recorded under 
other causes of death or injury statistical codes. 


EPIDEMIOLOGY AND GLOBAL RESEARCH IN TBI 
Although high-quality worldwide data of TBI incidence and prevalence 
are difficult to find, neurotrauma registries from high-income countries 
indicate that around 5.3 million people in the United States and nearly 
7.7 million people in Europe are living with TBI-related disability. The 
2010 GBD study* shows that in high-income countries an important 
cause of TBI is motor vehicle accidents, and that there has been a shift 
in the age of the affected population towards older groups. In LMICs, 
those with TBI are generally young adult pedestrians, cyclists or mo- 
torcyclists. In regions where the prevalence of armed violence is high- 
er (Central America, the Middle East and Central Africa), assault and 
gunshot injuries are important causes of TBI’. Deficits associated with 
TBI, including impaired attention, poor executive function, depression, 
impulsivity, poor decision-making and aggressive behaviour, have 
particularly striking social and economic consequences for individuals, 
families and the development of societies as a whole’. An example of 
the heterogeneity of the data in international epidemiological research 
in TBI is shown in Supplementary Table 1. 
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Global Health, Harborview Medical Center, Department of Neurosurgery, 325 Ninth Avenue, Seattle, Washington 98104, USA. “Surgery Critical Care Medicine and 
Clinical Translational Sciences Director of Global Surgery, University of Pittsburgh, 200 Lothrop Street, Pittsburgh, Pennsylvania 15213, USA. 
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Basic and clinical research in TBI have been focused on under- 
standing the biological process of the disease, developing advanced 
diagnostic tools, minimizing secondary brain injury and improving 
treatment guidelines. Unfortunately, the evidence generated from 
neurotrauma studies carried out in high-income countries does not 
always translate to LMICs, where the health infrastructure (including 
providers and facilities) is limited, creating a different context for care 
practice“. Recently published consensus statements, established in 
high-income countries, do not take into account the unique challenges 
that neurotrauma researchers may face in LMICs. Most of these evi- 
dence-based recommendations are best applied in well-funded and 
well-equipped neurosurgical or neurotrauma centres®’. The recent 
Benchmark Evidence from South American Trials: Treatment of In- 
tracranial Pressure (BEST Trip) trial was based on standard recommen- 
dations for randomized clinical trials in high-income countries'*~”. 
Results were far from expected because advanced monitoring tools 
used to guide treatments in high-income countries were not as suc- 
cessful in LMICs, and discussion within the global neurotrauma sci- 
entific community emerged after the publication of this study*?’. The 
interpretation and implications of the study for the neurotrauma field 
in high-income countries and LMICs are still being analysed**. The 
applicability of high-income-country clinical research standards in 
LMICs is an important topic for future international research. 

Research will focus on new trends for TBI care, including, but 
not limited to, the use of hyperosmolar fluids, blood components for 
early resuscitation and other strategies aimed at improving resusci- 
tation in patients with multiple injuries, including TBI?’ *. Addition- 
al aspects that are more applicable to LMICs, such as the importance 
of data collection (neurotrauma registries), capacity building for ad- 
vanced education in neurotrauma-care provision and research, and 
integration of teams within a trauma-care system have been recently 
proposed**s. Treatment strategies such as prophylactic hypothermia 
have also been considered as therapies with the potential for further 
research in LMICs. However, this would require organizational effort 
by the health-care systems to obtain reliable evidence. Multicentre 
collaborative approaches towards data collection are already in place in 
high-income countries; such endeavours may also be an efficient and 
productive strategy for TBI research in LMICs. Alcohol and substance 
misuse associated with TBI is a further key topic that needs to be ad- 
dressed in LMICs. Poorer outcomes have been associated with alcohol 
and substance misuse in high-income countries, but the findings were 
mixed and further research is required in different contexts**. 


EPIDEMIOLOGY AND GLOBAL RESEARCH IN SCI 

In 2013, the World Health Organization (WHO) and the Internation- 
al Spinal Cord Society (ISCoS) joined together to report SCIs world- 
wide*°, Unsurprisingly, similar to TBI, information in epidemiolog- 
ical studies from LMICs was limited. Since 2000, at least 7 papers have 
reviewed the epidemiology of SCI around the world*’? and 2 papers 
have focused on the epidemiology of SCI in LMICs***°, Common con- 
clusions relate to the lack of information available in LMICs owing to 
the absence of SCI registries — paradoxically, these are regions where 
incidence of the disease is high according to observational studies. Re- 
ported incidence ranged from 12 to close to 60 cases per million inhab- 
itants in different countries (see Supplementary Table 2). It is difficult 
to compare the data owing to the heterogeneity of the studies, which 
had diverse methods for reporting and classifying the disease. 

SCI registries from high-income countries and meta-analysis of 
studies reporting incidence of the disease allow us to estimate that 
worldwide, every year nearly 250,000 to 500,000 people sustain an 
SCI**4’, Historically, around 90% of SCIs have been associated with 
trauma; however, in an analysis of high-income-country registries, 
non-traumatic SCI has recently increased to beyond 10%. The trau- 
matic SCI population is young, especially in LMICs. Underestimation 
of SCIs is common and with the exception of a few countries that 
have countrywide registries (Finland, Canada and the United States), 
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incidence estimates are extrapolated from city or regional data that 
may not be representative of the countries as a whole*. 

The three most common causes of SCI across the world are road 
traffic accidents, falls and violence. Because of the low quality of data, 
especially in LMICs, there may be country-level variation in the causes 
or the context of injury, especially in areas with higher levels of so- 
cial violence (Central America, the Middle East and Central Africa). In 
studies from Africa, transportation-related events account for nearly 
70° of cases; in countries affected by war such as Afghanistan, around 
60% of all SCI cases are related to violence. It has been estimated that 
work-related injuries contribute to at least 15% of all traumatic SCIs. 
There are consistently higher incidence rates in adult males, and the 
two groups most likely to have an SCI are young and elderly males. 
Life expectancy for patients with an SCI is shorter than the average 
in LMICs, as well as in comparison with patients with an SCI from 
high-income countries**. 

Inasimilar way to TBIs, most SCI research is carried out in high-in- 
come countries, and focused on the basic science of the biological pro- 
cess of the disease, helping to develop treatment guidelines and the 
application of advanced technology for nerve reconstruction, sophis- 
ticated prosthesis and advanced rehabilitation. Many of these studies 
are not feasible in LMICs where basic science resources are scarce and 
advanced rehabilitation is almost non-existent. Recommendations by 
researchers from high-income countries for designing SCI clinical tri- 
als have been published, but the applicability of these recommenda- 
tions to the LMIC context has yet to be determined’. Organization of 
neurotrauma-care systems and capacity building for neurotrauma and 
SCI registries may have an effect in LMICs, but they are not priorities 
for researchers from high-income countries. Crucial aspects, such as 
the relationship between pre-hospital care and outcomes for patients 
with SCI are difficult to analyse in LMICs because pre-hospital care is 
not widely available’. An example of the difficulties in SCI research 
owing to a lack of data is presented in a review about pressure ulcers 
as a complication in patients with SCI in LMICs*!. Over the past few 
years, researchers in China have been making important steps towards 
evaluating cellular therapies and improving the quality of life of those 
with SCI. Registries are now available to improve epidemiological data 
collection within the country”. 


Lessons learned from clinical neurotrauma research 
In this section, we present examples of active neurotrauma research 
groups from LMICs. 


Latin America. The three examples from Latin America draw upon 
our direct experience of working in Argentina, Bolivia, Colombia and 
Ecuador. 

Between 2008 and 2011 a randomized controlled trial of intracra- 
nial-pressure (ICP) monitoring in patients with TBI, which compared 
the management of patients with severe TBI that was based on infor- 
mation from ICP monitoring with treatment that was based on imag- 
ing and clinical examination without ICP information, was performed 
in Bolivia and Ecuador. The study reported no difference in outcomes 
between these groups. The study is considered to be class 1 — it has 
high internal validity. Thus, for LMICs, the study provided concrete in- 
formation on which to base resource-allocation decisions, and docu- 
mented the clinical success of a treatment approach that is sustainable 
in low-resource environments. Sufficiently skilled clinical staff with a 
better organized protocol of care could produce good recovery results 
in the intensive care unit (ICU) without data from an ICP monitor by 
using clinical assessment to manage intracranial hypertension® °°. 

A multicentre randomized controlled trial of post-discharge care 
of paediatric traumatic brain injury in Argentina aimed to develop, 
introduce and test a family-provided, post-discharge intervention for 
children with complicated mild, moderate and severe TBI. Multiple 
research methods were used, beginning with focus groups with chil- 
dren who had sustained a TBI, as well as with their parents, physicians, 
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nurses and social workers. The focus group experience was one of the 
most important aspects of this project. It gave the high-income-coun- 
try research team an appreciation of the realities of TBI in these com- 
munities, and allowed for an ecologically relevant approach to devel- 
oping an intervention. The participating hospitals elected to maintain 
the intervention, and said that the protocol improved overall quality 
of care for the children and their families. An unexpected finding was 
that despite reports that paediatric TBI is a serious problem in Latin 
America®’, hospitals in this study saw, on average, fewer than one child 
with TBI per month. 

In another study, a standardized trauma-care protocol decreased 
in-hospital mortality of patients with severe TBI in a LMIC teaching 
hospital®*. The standardized trauma-care protocol was based on gen- 
erally accepted best practices; damage-control resuscitation strategies 
were proposed based on military protocols from war scenarios in the 
Middle East. With the knowledge that most hospitals in LMICs have 
financial or logistical limitations in building evidence-based proto- 
cols and do not have a pre-existing trauma registry, an administrative 
electronic database was adapted to capture clinical information about 
adults with TBI. Adherence to the protocol was limited — around 
60%. Surprisingly, the barriers to adherence were not associated with 
resources or technology, but with human factors related to changing 
established practices. How to create motivational interventions to 
change practice is an important research question for LMICs. 


China. Here we summarize studies of the use of decompressive 
craniectomy and hypothermia in the treatment of severe TBI 
conducted in China. 

Ina study of decompressive craniectomy, the influence of a stand- 
ard, larger, unilateral frontotemporoparietal bone flap (a standard 
trauma craniectomy) was compared with a limited, smaller temporo- 
parietal bone flap (a limited craniectomy) based on a 6-month Glasgow 
Outcome Scale (GOS) score and complications. The investigators found 
significantly greater mortality in patients with a limited craniectomy”’. 
A second study in China compared 1-month mortality, complications 
and the 1-year GOS score of larger unilateral decompressive craniecto- 
my with routine unilateral temporoparietal decompressive craniecto- 
my. One-month survival and one-year GOS scores were significantly 
better in the larger unilateral decompressive craniectomy group; how- 
ever, they had a higher rate of complications”. These findings con- 
tribute important information for LMIC environments where decom- 
pressive craniectomy may be the only treatment option available for a 
quick resolution of ICP. 

Similar to decompressive craniectomy, prophylactic hypothermia 
for the treatment of ICP is a relatively low-technology option available 
in LMICs. However, its influence on patient outcome is yet to be clearly 
demonstrated. Ina comparison of a longer course of mild hypothermia 
(33-35°C for 3-14 days) with normothermia, mortality was found to be 
lower and 1-year GOS scores were better for the hypothermia group”. 
A subsequent study that compared short-term (1-3 days) and long- 
term (4-6 days) mild hypothermia found that patients given a course 
of hypothermia for 5 days had significantly better 6-month GOS scores 
than those given a 2-day course®. Finally, patients who received sys- 
temic cooling (full body) were compared with those who received 
selective brain cooling (head only) and normothermia. Pneumonia 
rates were lowest and 2-year GOS scores were highest in patients who 
received selective cooling**. Although the results of these studies are 
promising, the findings are tempered by contradictory findings from 
similar studies conducted in other countries. What is important for re- 
search in LMICs is the question of how to manage a crucial aspect of 
TBI, ICP, in the absence of technological resources. 


India and Nepal. Recent studies from India and Nepal describe the 
experience of creating surveillance and research infrastructures in 
extremely austere conditions. 

In India, the WHO’s Standards for Surveillance of Neurotrauma 
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were used to design and build a simple data collection instrument, and 
an observational study of TBI was conducted in a rural teaching hospi- 
tal. Over 6 months, data on 414 patients were collected and descriptive 
statistics about a sample were reported. Logistical difficulties, includ- 
ing a lack of closely managed data collection and entry, inconsistent 
coding and missing data were recorded®. In an epidemiological study 
of trauma in a hospital in the Eastern region of Nepal, data on 6,793 pa- 
tients over 1 year were collected, and a subset of TBI cases was reported 
on. This is the first study in Nepal that collected comprehensive patient 
profiles and reported outcomes in detail®*. The authors concluded that 
trauma-related injury significantly contributes to morbidity and mor- 
tality and is the third leading cause of death in the region. 

The examples from Latin America, China, and India and Nepal il- 
lustrate the vast differences in the spectrum of neurotrauma research 
across LMICs. Neurotrauma research at the global level needs to be 
contextualized — different regions have their own needs and challeng- 
es during the research process. Certain interventions may be high pri- 
ority in one country, but low priority in another. 


Research priorities, opportunities and challenges 
Although CNS injury is important, we must acknowledge that an iso- 
lated brain or spinal cord injury represents a small fraction of the bur- 
den of trauma as a disease, but they occur frequently in the context 
of the multiple-injury patient. From a mechanistic viewpoint, isolated 
CNS trauma is the best model to understand the pathophysiology of 
brain or spinal injury, but it is naive to ignore the fact that patients with 
multiple trauma injuries have a conglomerate of systemic events that 
affect the brain. We must, therefore, study CNS injury in the setting 
in which it most commonly occurs: the patient with multiple injuries. 
This context needs to be part of the research portfolio in global health, 
especially in LMICs where it is difficult to measure the interactions of 
different interventions in the same patient. Assessment could, howev- 
er, improve with better organization of the existing resources. 

The impact of new resuscitation techniques, early use of blood 
products and early evaluation of coagulopathy in patients with TBI or 
SCI could be key in areas where violence is an important cause of in- 
jury and transport times to hospital are long. The impact of non-in- 
vasive intracerebral blood detectors, advanced airway management 
by non-physicians, and pre-hospital resuscitation fluids could be a 
priority in areas where organized trauma systems do not exist. Most 
LMICs do not have organized pre-hospital care. If it exists, it is not 
consistent and there are no evidence-based transport protocols. There 
is little training for ambulance staff, which may or may not include a 
physician. If physicians are present, often they are not trained in emer- 
gency medicine. It is possible that in LMICs the most important area of 
research is within the public health system in order to demonstrate the 
benefits of an organized pre-hospital care system to improve patient 
outcomes, and to reduce costs both in hospital and post-discharge. 
Establishing systematic surveillance systems to accurately identify in- 
cidence, prevalence, processes of care and outcomes following TBI and 
SCI are essential priorities for research in areas where these systems 
do not exist. 


Capacity-building priorities and opportunities 

Countries where TBI and SCI are a significant burden of disease are also 
those with substantial gaps in services that affect the entire spectrum 
of trauma care, including prevention, pre-hospital care, specialized 
neurotrauma care, rehabilitation, quality control and research. As 
daunting as it may seem to propose capacity-building activities in all 
these areas, the comprehensive management of TBI and SCI will re- 
quire human resources, infrastructure and research training aimed at 
enhancing capacity in all these components. Because resources are 
limited, the next fundamental question is how to establish priorities so 
that these areas can advance in parallel. Research training grants and 
collaborative research between partners in LMICs and high-income 
countries should include the creation of multidisciplinary teams of 
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health-care professionals that work in prevention, pre-hospital care, 
clinical care and clinical epidemiology to expand overall human-re- 
source capacity. At the same time, basic training in provision of care is 
gravely needed, as is research training. The capacity-building curric- 
ulum for different parts of the health system will differ, but they must 
all introduce the concepts that are inherent to the creation of a com- 
prehensive trauma system. 

Specific capacity-building activities should address the design and 
feasibility of using modular theme-based trainee curricula that em- 
ploy enabling technologies such as e-learning and teleconferencing; 
low-tech clinical simulation training that emphasizes early life-saving 
interventions or procedures; and team-training techniques to accen- 
tuate the collaborative nature of neurotrauma care. Strengthened ca- 
pacity to use information and communication technologies to support 
research and research-training programmes is also needed. 


CONCLUSION 


Enormous challenges must be met to significantly advance neurotrau- 
ma research worldwide, particularly in underserved areas and austere 
environments. Experts beyond clinical practitioners and basic science 
researchers will need to participate in order to meet these challeng- 
es. The introduction of inexpensive and innovative interventions, in- 
cluding communication technologies, mobile-health applications and 
policy management approaches that meet the needs of a particular 
local environment is the ultimate goal. Simply transferring a clinical 
question from a resource-rich environment to that of a LMIC which 
lacks sophisticated interventions may not be the best strategy to ad- 
dress the needs of LMICs. Furthermore, the findings of studies con- 
ducted in resource-rich environments may not necessarily result in 
evidence-based guidelines that can be implemented in health-care 
scenarios with more-limited resources. 

Anew context for capacity building in neurotrauma should include 
broad international collaborations and global-health opportunities 
directed at creating not only advanced researchers, but also health 
leaders who work in field research and health-policy development and 
implementation. Fundamental questions in research that are relevant 
to LMICs need to go beyond health-care facilities and medical schools. 
Emphasis on promoting the design of true ‘ecological’ studies that in- 
clude evaluation of human factors in relation to the process of care, an- 
alytical descriptions of health systems, and how leadership is applied 
in the medical community and society as a whole will be crucial. 
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The characteristics of neurological, psychiatric, developmental and substance-use disorders in low- and middle-income coun- 
tries are unique and the burden that they have will be different from country to country. Many of the differences are explained 
by the wide variation in population demographics and size, poverty, conflict, culture, land area and quality, and genetics. Neuro- 
logical, psychiatric, developmental and substance-use disorders that result from, or are worsened by, a lack of adequate nutrition 
and infectious disease still afflict much of sub-Saharan Africa, although disorders related to increasing longevity, such as stroke, 
are on the rise. In the Middle East and North Africa, major depressive disorders and post-traumatic stress disorder are a primary 
concern because of the conflict-ridden environment. Consanguinity is a serious concern that leads to the high prevalence of re- 
cessive disorders in the Middle East and North Africa and possibly other regions. The burden of these disorders in Latin American 
and Asian countries largely surrounds stroke and vascular disease, dementia and lifestyle factors that are influenced by genetics. 
Although much knowledge has been gained over the past 10 years, the epidemiology of the conditions in low- and middle-in- 
come countries still needs more research. Prevention and treatments could be better informed with more longitudinal studies 
of risk factors. Challenges and opportunities for ameliorating nervous-system disorders can benefit from both local and regional 
research collaborations. The lack of resources and infrastructure for health-care and related research, both in terms of person- 
nel and equipment, along with the stigma associated with the physical or behavioural manifestations of some disorders have 
hampered progress in understanding the disease burden and improving brain health. Individual countries, and regions within 


countries, have specific needs in terms of research priorities. 
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s outlined in the introduction to this series (see page S151), the 
proportion of the global burden of disease (GBD) due to neu- 
rological, mental health, developmental and substance-use 
(NMDS) disorders is rising worldwide'. The type of disorder and rea- 
son for increase varies across countries’, regions and populations as 
indicated by the regional differences in disability adjusted life years 
(DALYs; a metric developed to take both mortality and morbidity 
measures into account). DALYs for a disease or health condition are 
calculated as the sum of the years of life lost (YLL) due to premature 
mortality in the population and the years lost due to disability (YLD) 
for people living with the health condition or its consequences (http:// 
www.who.int/healthinfo/global_burden_disease/metrics_daly). The 
first regional use of DALYs, the regional patterns of disability-free life 
expectancy and disability-adjusted life expectancy, were reported by 
the Global Burden of Disease Study?. 
Opportunities to ameliorate nervous system disorders could be 
increased by both local and regional research collaborations. Lessons 
learned locally, and those learned in collaboration across regions and 


countries, may be adapted and applied to other areas, there may also 
be opportunities to leverage resources. Some disorders have physical 
boundaries, whereas others have sociocultural and economic contexts. 
‘Thus, the challenges faced in high-income countries are often different 
from those in low- or middle-income countries (LMICs) in type, char- 
acteristic or scale. Population demographics, genetics, income, reli- 
gion, culture, language, ethnic origin, conflicts, land area and quality, 
and population size vary widely between and within LMICs. Although 
there is some commonality in the prevalence of certain brain disorders 
(Fig. 1), significant diversity exists with respect to the origin, mani- 
festation and treatment strategies or options adopted across these re- 
gions. In this Review, we focus on sub-Saharan Africa, the Middle East 
and North Africa, Asia, South and Southeast Asia and Latin America*”. 
We introduce a regional perspective with respect to NMDS disorders, 
highlighting what has been learned from epidemiological differences 
between LMICs as well as globally, while identifying specific needs, 
research priorities and the opportunities for collaboration among dif- 
ferent LMICs (Tables 1-4). 
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SUB-SAHARAN AFRICA 

Malnutrition, from birth through to adulthood, seems to be the most 
significant contributor to disease burden and disability in sub-Saharan 
Africa’. Maternal malnutrition, including micronutrient deficiencies 
such as vitamins and iodine, impairs the development and function 
of the nervous system of offspring, and negative effects can persist 
in the next generation®. Other forms of maternal and environmental 
trauma during the perinatal period affect brain development and cause 
long-term changes in brain function. Neurological disorders caused 
by eating toxic foodstuffs are unique to sub-Saharan Africa. Cassava 
is an important food crop that contains endogenous neurotoxins and, 
if not properly prepared, can cause konzo — a peripheral polyneurop- 
athy with prominent sensory loss and ataxia. Lathyrism that presents 
as spastic paraparesis is an equally debilitating neurological disorder 
caused by excessive ingestion of the grass pea Lathyrus sativus that 
contains the excitotoxic amino acid, S-N-oxalyl amino-L-alanine’. 

Use of psychostimulants is another major contributor to the bur- 
den of brain disorders in sub-Saharan Africa’. Of particular concern 
is the high prevalence of maternal alcohol and methamphetamine use 
in areas such as the Western Cape Province of South Africa. The inci- 
dence of fetal alcohol syndrome in some locations within this region 
is the highest in the world®. The increase of methamphetamine use in 
pregnant women in the Western Cape is of concern given the negative 
effects that the drug has on the developing fetus’. Khat use is of con- 
cern in East Africa!°", where 60-90% of men use the drug daily?’. 
The consequences of habitual khat consumption include behavioural 
disturbances and toxic psychosis, which has a particular impact on the 
overall health of young adults. 

The prevalence and incidence of epilepsy in sub-Saharan Africa 
countries is twofold higher than that of other countries“. The preva- 
lence varies between 4.5 and 20.8 per 1,000 people, owing to the local- 
ized and high incidence of parasitic infections, poor perinatal care and 
poor access to treatment. The full burden of epilepsy in sub-Saharan 
Africa is difficult to assess and is likely to be under-reported because 
people with epilepsy are stigmatized and frequently left untreated’. 
Stroke is another concern among non-communicable disorders within 
sub-Saharan Africa — incidence is increasing at an alarming rate”’. The 
prevalence of dementia in sub-Saharan Africa is reportedly much low- 
er than in other regions”. However, these reports may not be a true 
reflection of the prevalence, which it is projected to increase with an 
increase in lifespan. Furthermore, as research extends into rural areas, 
diagnosis of unreported cases may reveal the true burden. 

Sub-Saharan Africa has the highest burden of infectious diseases 
and the poorest public health infrastructure in the world®*’. Parasitic 
infections are also highest in this region and often have neurocognitive 
sequelae. HIV-associated neurological disorders area major burden, 
with more than 1.5-million children living with HIV and at risk of de- 
veloping HIV-associated cognitive impairment and dementia!™. Little 
is known of the effects of HIV and antiretroviral treatment on the de- 
veloping brain. There is an urgent need for research on the longitudinal 
trajectory of neurodevelopment among children and adolescents who 
are perinatally infected with HIV. Cognitive and psychiatric problems 
have been found to decrease antiretroviral treatment adherence and 
survival of adults with HIV in Zambia*. Neuroimaging and neurocog- 
nitive testing are well established in several regions within sub-Saha- 
ran Africa and have been used in cross-country collaborations to fur- 
ther our understanding of the spectrum of neurocognitive disorders in 
patients with HIV and to determine the effect of antiretroviral therapy 
on these individuals**. Subtle changes in white-matter integrity have 
been used for early diagnosis and monitoring progression of neurolog- 
ical disease in individuals with HIV”. 


MIDDLE EAST AND NORTH AFRICA 

Many of the aetiological and treatment features of psychiatric disorders 
in the Middle East and North Africa are due, in part, to the unique envi- 
ronmental and cultural influences within the region. Over the past few 
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Figure 1 | Comparison of disability associated life years (DALYs) between 
high-income and low- and middle-income countries. The data were derived from 
the World Health Organization (http://www.who.int/healthinfo/global_bur- 
den_disease/metrics_daly/) and refs 4, 5. 


decades, communities have been exposed to traumatic events includ- 
ing anti-government uprisings and wars, which has left many pop- 
ulations vulnerable to mood disorders, such as post-traumatic stress 
disorder (PTSD) and major depressive disorder (MDD). In comparison 
with the global estimate of 4.4% (ref. 27), depression prevalence in 
Iraq is 7.2% and is 15.3% in the Palestinian territories**”’. In fact, MDD 
is currently listed among the top three causes of YLDs in most of the 
countries within the Middle East and North Africa’. The statistics are 
similar for PTSD within the region. 

Owing to the high rate of consanguinity in the region, the in- 
cidence of several recessively inherited genetic disorders, such as 
inherited deafness, is increasing®”**. For example, Bardet-Biedl 
syndrome, which includes many nervous system abnormalities, 
is common in most of the Arab countries, particularly in Kuwait. 
Whereas the syndrome typically affects 1 in 150,000 people in North 
America and Europe, the prevalence in Arab countries ranges from 1 
in 13,500 to 1 in 30,000 people*’. A national strategy is needed in this 
region to address this burden of genetic disease. Although services 
such as genetic screening exist, understanding the barriers to access 
and use requires implementation research and an understanding of 
sociocultural norms. This will help health workers to tailor services 
and educational campaigns that are culturally acceptable. 

The prevalence of substance-use disorders varies between 7.25% 
and 14.5%, with cannabis being the most commonly used drug fol- 
lowed by alcohol**’. Khat is also widely used as a stimulant in Yemen 
and the neighbouring countries within the Arabian Peninsula. 

There is a need for population-based prevalence estimates of 
common neurological disorders in the Middle East and North Af- 
rica, with a special emphasis on epilepsy, because systematic epi- 
demiological studies of epilepsy in Asia and Africa have not includ- 
ed this region**. Most published studies only report hospital-based 
samples*. For example, a review of seizure disorders in Arab coun- 
tries indicated a median prevalence of 2.3 per 1,000 people (range, 
0.9-6.5 per 1,000). These figures are very likely to underestimate 
the prevalence in a population of more than 350-million peo- 
ple*, particularly because epilepsy is stigmatized within several 
communities*”. 
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LATIN AMERICA AND THE CARIBBEAN 

Within the countries and territories of Latin America and the Caribbe- 
an (Central America, Mexico and the Latin Caribbean); the non-Latin 
Caribbean and South America there are sub-regional differences in the 
contribution of NMDS disorders to the total burden of disease meas- 
ured in DALYs. Although DALYs owing to neurological disorders, in- 
cluding stroke, are low in the Andean Latin American sub-region, they 
are higher in the southern Caribbean sub-regions and even higher in 
tropical Latin America and the Caribbean. However, if one considers 
the total region, the burden of NMDS disorders accounts for 22.2°% of 
the total DALYs. The overall weighted prevalence of mental health dis- 
orders in children in the region (12.7%) is significantly more than the 
prevalence (9.7%) seen in United Kingdom when similar diagnostic 
procedures are used**. Importantly, there is inadequate information on 
risk and protective factors that affect the incidence of mental health 
disorders in children living in developing countries in general and Lat- 
in America and the Caribbean in particular”. 

Unipolar depressive disorders (13.2%) and alcohol dependence 
(6.9%) constitute the most common psychiatric disorders“? in Latin 
America and the Caribbean (Fig.1). The annual level of alcohol con- 
sumption (8.4 litres per capita annually) is the second highest in the 
world after Europe*!. Alcohol consumption has been associated with 
roughly a third of intentional and non-intentional accidents”; trau- 
matic brain injuries incurred from any type of accident have long-term 
implications for society and for the individual, including impaired at- 
tention, depression and economic costs to families”. 

As for other regions the current increasing trend in DALYs for 
non-communicable disorders? suggests that epilepsy and dementia 
are unique in terms of their increasing prevalence. Their prevalence or 
manifestation is increasing in Latin America and the Caribbean. The 
annual incidence of epilepsy according to a collection of 32 commu- 
nity-based studies is 77.7 to 190 per 100,000 people each year**, com- 
pared with 30 to 50 per 100,000 people in high-income countries. 
Distribution of epilepsy across sub-regions of Latin America and the 
Caribbean also differs; one reason for this is the direct association be- 
tween epilepsy and the distribution of neurocysticercosis**. Dementia 
is also widespread***’, but pockets of early onset Alzheimer’s disease in 
families are apparent in Caribbean Hispanic people who originate from 
Puerto Rico or the Dominican Republic”. Studies on familial types of 
dementia in Latin American countries such as Colombia (Alzheimer’s 
disease) and Venezuela (Huntington’s disease) have shown that both 
non-genetic (nurture) and unrelated genetic factors may have a major 
role in influencing phenotypes** °°. This suggests that even highly pen- 
etrant autosomal dominant diseases may be modified by environment 
or lifestyle factors. Although not unique to the region, it is worth noting 
that stroke is the leading cause of death in Ecuador, and in other Latin 
American countries”. Little is known about the prevalence of any of 
these disorders among indigenous Andean or Amazonian populations. 


ASIA 
Sub-regions of Asia comprise East and Southeast Asia, and incorporate 
the Association of Southeast Asian Nations as well as China, whereas 
South Asia consists of sub-Himalayan countries, including Afghani- 
stan, Bangladesh, India, Pakistan and Sri Lanka. About two-thirds of 
the world’s population resides on the Asian continent. India and Chi- 
na, because of their size and economic impact, have a major influence 
on the health and trends of the region, and in shaping global health 
statistics, however they are catalogued. Asia’s ethnic diversity, and 
widely disparate socioeconomic development lead to significant var- 
iations in the prevalence and burden of NMDS disorders. An epidemi- 
ological study of epilepsy in 23 Asian countries revealed the lifetime 
prevalence of epilepsy to be 1.5 to 14 per 1,000 people. Infections of the 
nervous system often contribute to epilepsy and prevention of these 
infections is needed to reduce the burden of the condition. 

Another major concern is the rising prevalence of dementia; al- 
though the number of patients with dementia is predicted to increase 
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by 100% between 2001 and 2040 in developed countries, dementia is 
predicted to increase by more than 300% in India, China, South Asia 
and the Western Pacific region”. In India alone, there are 3.7 million 
people with dementia and the numbers are expected to double by 2030 
(ref. 53). In addition, the high burden of cardiovascular risk factors in 
developing countries, including India, contributes to cerebrovascular 
disease such as vascular dementia™. 

Asia, in particular South Asia, has the highest stroke mortality 
in the world®*. Within Asia, there is a wide variation in stroke preva- 
lence®**. Rural parts of South Asia have lower stroke prevalence than ur- 
ban areas®*, and this needs to be examined further in future research®’. 
In China, the incidence of stroke differs geographically. A higher inci- 
dence of stroke is seen in northern and western areas, and is associat- 
ed with a higher prevalence of hypertension and obesity**. Barriers to 
preventing and reducing mortality and disability due to stroke are the 
lack of infrastructure, such as dedicated stroke care units, and aware- 
ness*’. 

Tobacco use — a leading cause of stroke — is a major public health 
issue for East and Southeast Asia. Half of the world’s tobacco consump- 
tion takes place in Asia*’. Men are more likely to smoke than women; 
and prevalence rates for males range across countries from 36% in Sin- 
gapore to 64% in Laos®. Although the neurological and other health 
implications of smoking are well known, many Asian people still 
smoke. Public health measures to reduce smoking are just beginning; 
for example, in June 2005 and October 2008, India and Beijing banned 
indoor smoking in public places and offices, respectively. 


COMMON RESEARCH NEEDS AND CHALLENGES 

There are several commonalities within LMICs in terms of disease prev- 
alence and the public health and research challenges, although consid- 
erable ethnic and geographical diversity exists. 


Lack of robust epidemiological studies 

Epidemiological studies, preferably longitudinal, designed to identi- 
fy disease burden and risk or protective factors for NMDS disorders, 
are one of the most important research needs in LMICs. These need to 
be complemented by research on health systems and sociocultural ef- 
fects, and clinical trials to determine the best interventional strategies. 
Furthermore, rapid urbanization and the associated demographic and 
sociocultural changes in LMICs should be studied with respect to their 
impact on the course and outcome of different brain disorders, espe- 
cially mental health illnesses and substance misuse. A careful analy- 
sis of the possible interaction between demographic and sociocultural 
changes, and biological factors is essential to initiate remedial steps to 
contain the progression of these disorders. 


Disproportionate distribution of scientists 

Some countries have a disproportionate share of scientists, with in- 
vestment and output concentrated in only a few places. In general, 
Latin America produces more neuroscience and mental health dis- 
order publications than the Middle East and Africa. Similar variation 
is seen in the number of neuroscience publications produced in Asia 
(Fig. 2). Between 1996 and 2013, India consistently produced the most 
neuroscience and mental health research publications. Figures also 
reveal that 9.2% of institutions in India produce 80.1% of the publi- 
cations. Among Latin American and Caribbean countries, Brazil now 
accounts for more than two-thirds of South America’s entire research 
output, although in terms of articles per capita, it is broadly similar 
to Argentina, Uruguay and Chile. One could leverage this situation by 
promoting intraregional research collaborations to enhance research 
capacity and infrastructure. The top 10 African countries in terms of 
health-research publications from 2000 to 2014 are South Africa, Ni- 
geria, Kenya, Uganda, Tanzania, Ethiopia, Ghana, Cameroon, Malawi 
and Senegal*!. Although these trends comprised all health research, 
it is likely that mental health publications are ranked similarly in 
sub-Saharan Africa. 
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Table 1 | Neurological, mental health, developmental and substance-use disorders and specific burden of disease in sub-Saharan Africa 


Condition or disease 


Nutrition: malnutrition 


Substance use: cannabis, 
methamphetamine, khat, 
alcohol, and opiods or 
heroin 


Stroke 


HIV-associated 
neurological conditions 


Key affected countries 
AlLSSA 


Cameroon, Central African Republic, 
Democratic Republic of Congo, Tanzania, 
Ethiopia and Mozambique 


Ethiopia 


West and Central Africa (notably), and South 
Africa 


South Africa 


Tanzania, Kenya, Uganda, Ethiopia, Eritrea and 
Somalia 


South Africa 


West and Central Africa and South Africa 


AlLSSA 


AlLSSA 


Nigeria, Democratic Republic of Congo, 
Senegal, Central African Republic, Tanzania, 
Zambia and Kenya 


South Africa, Kenya, Nigeria, Zambia, Malawi, 
Cameroon, Botswana and Uganda, 


Burden of disease 


204 million people suffer from hunger**** 


Highest prevalence of stunting in the world is in 
East Africa (42%) and West Africa 36% based on 
the WHO Child Growth Standards** 

22% of children are underweight in West Africa®* 


Reported estimates show there are around 

6,500 cases of cassava toxicity; unofficial reports 
estimate the number of cases to be at least 100,000 
(ref. 85) 


Cannabis use is higher than the global average 
(12.4% versus 3.8%)*” 

Cannabis is the most popular illicit drug followed 
by cocaine 


Increased use of methamphetamine during 
pregnancy*?** 
General increase in drug use*” 


60-90% of East African males use khat daily’****° 


Self-reported prevalence of alcohol abuse is 36.9% 
Fetal alcohol syndrome in the local Western Cape 
population is the highest in the world® 


Annual prevalence of heroin use is above the 
global average®* 

0.92-2.29 million people used opiates in the past 
year*” 


Prevalence varies between 4.5 and 20.8 per 1,000 
people; about twice that elsewhere’*” 


Community-based studies revealed an age- 
standardized annual stroke incidence rate of up 
to 316 per 100,000 of the population, and age- 
standardized prevalence rates of up to 981 per 
100,000 (ref. 92) 

65% of all neurological admissions to hospitals are 
stroke related in the West African sub-region” 


Prevalence is between <1% and 10.1% in 
population-based studies and up to 47.8% in 
hospital-based studies” 


1.5-million children are living with HIV and are 
at risk of developing HIV-associated cognitive 
impairment and dementia** 

Prevalence of HIV-related neurocognitive 
impairment ranged from <1% to 80% in hospital- 
based studies” 


Maternal malnutrition impairs the 
development and function of the nervous 
system of offspring and negative effects persist 
in the next generation® 

Malnutrition in infants and children affects 
their growth and cognitive development** 


Leads to difficulty in walking™ and peripheral 
polyneuropathy with prominent sensory loss 
and ataxia® 


Malnutrition may increase the negative impact 
of food borne toxins and causes irreversible 
spasticity®. 


1 in 18 problem drug users receive treatment; 
most of those in treatment are cannabis users*” 


Structural (volume reductions in the striatum 
and increases in limbic areas of the brain) 
and functional deficits as well as cognitive 
and behavioural abnormalities have been 
described in infants and children exposed to 
methamphetamine prenatally? 

Violent behaviour in adults 

Cognitive dysfunction® 


Chronic khat use may have a long-term 
deleterious effect on working memory” 


Negative effects on the developing fetus? 
Fetal alcohol syndrome, growth retardation 
and cognitive dysfunction® 


An increasing role as a transit area for drug 
trafficking and increased crime rate*” 


Impaired cognitive function due to effect of 
seizures on the developing brain” 
Stigma and social isolation” 


Increased burden to society 


A burden to family and society 


HIV-related dementia is a particular concern, 
and burden, in SSA as people live longer with 
the disease 

Children infected with HIV perinatally do not 
perform as well as non-infected children on 
cognitive tests and are at much higher risk for 
psychiatric disorders later in life 


SSA, sub-Saharan Africa; WHO, World Health Organization. The 2014 population estimates for sub-Saharan Africa were 961.5 million (http://data.worldbank.org/region/SSA) 


Insufficient resources for treatment and research 

Most countries allocate less than 5% of their health-care budget to the 
treatment of brain disorders’. For example the Middle East and North 
Africa, Palestine, Qatar and Egypt, spend only 2.5%, 1% and less than 
1% on brain-disorder treatment, respectively. The number of mental 
health professionals available in most LMICs is also very low. For ex- 
ample, there are only 1.44 psychiatrists per 100,000 people in Egypt. In 
India, 52% of the districts do not have psychiatric facilities, and there 
is an acute shortage of psychiatrists, psychologists and psychiatric so- 
cial workers®>. Hence people with neuropsychiatric disorders remain 
largely undiagnosed and even when they are diagnosed, they do not 
have access to sustainable, affordable treatment and optimal medi- 
cal care®. Although a recent World Bank report indicates that disease 
burden that results from non-communicable causes, including mental 
health disorders, has increased substantially, with major depressive 
disorders at the top of the list (http://www.healthdata.org/gbd/data)? 
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there is a severe lack of resources, particularly of trained personnel and 
training facilities*’. Given the severe fiscal and human-resource con- 
straints for treatment, it is not surprising that research is lagging. The 
current research gap between developed and developing nations is re- 
flected in the mental health research output, with LMICs contributing 
to only 6% of international research articles™. 


Brain drain 

Brain drain is the loss of highly trained people, constituting another big 
challenge to LMICs, and widening the research gap between high-in- 
come countries and LMICs. The reasons cited by researchers for their 
exodus are a dearth of funding, poor facilities, and limited or a lack of 
peer groups to provide intellectual stimulation”. Although it may be 
argued that brain drain is a common problem in LMICs across disci- 
plines, neuroscience research is particularly affected. This is because 
unlike core disciplines such as chemistry, physics or mathematics, 


$201 


REGIONAL RESEARCH | RAVINDRANATH ET AL. 


Table 2 | Neurological, mental health, developmental and substance-use disorders and specific burden of disease in South Asia and Southeast Asia 


Condition or disease Key affected countries 


Burden of disease 


ion or disease 


Mood disorders Vietnam, Cambodia and South Asia countries . Depression is the second most common NMDS . Substantial impact on society in general and family 
disorder in Vietnam (2.8% prevalence) and in particular 
Cambodia (16.7% prevalence); there is a relatively ° Patients with psychiatric disorders under diagnosed 
high prevalence (23.6%) in elderly Chinese®™**° and undertreated due to scarcity of physicians’” 
. Anxiety is the most common NMDS disorder in coupled with absence of evidence for effectiveness 
Cambodia (27.4% prevalence)** of treatment”® 
. Prevalence of 16/1,000 population in India®® ° People with dysthymia have impaired quality of life 
° Unipolar depression ranks among the top 10 and poor marital adjustment”? 
disorders* 
AllAsia ° Predicted to increase by more than 300% in India, ° People with dementia who live with families puts 
China, South Asia and the Western Pacific region’? significant burden on carers. None of the carers 
° 9 million Chinese have dementia’® receive carer benefits and have high levels of 
° The rate in people over 60 was 3.4% in Thailand psychological morbidity’ 
and 3.5% in Indonesia! e Annual cost of dementia in China is US$2,384 per 
° China and India are predicted to have the largest patient annually’” 
number of dementia cases in the next decade!” 
0 An estimated 3.7 million Indians have dementia 
and the numbers are expected to double by 2030 
(ref.102) 
Stroke All Asia . Prevalence of 45-471 per 100,000 people in South Leading cause of death, long-term disability 
Asia’® Incidence of stroke differs geographically in China — 
. Annual stroke mortality in China is 1.6 million, there is a higher incidence in northern and western 
approximately 157 per 100,000, which has areas, which are associated with higher prevalence 
exceeded heart disease as the leading cause of of hypertension and obesity* 
death and adult disability! . Rural parts of South Asia have a lower stroke 
° Among a sample of five ASEAN countries prevalence compared with urban areas” 
(Indonesia, Myanmar, Vietnam, Thailand and . There are less than 100 stroke care units in South 
Malaysia), stroke was the top cause of death! Asia*’, leading to poor care for patients and 
. Stroke mortality in South Asia is the highest in the increased morbidity and mortality 
world, accounting for more than 40% of global ° Barriers to stroke thrombolysis in South Asia include 


stroke deaths*’ 
° Mortality in South Asia is 73 per 100,000 of the 
population’” 


a lack of infrastructure, lack of awareness and a lack 
of affordability®’, leading to increase in morbidity 
and mortality 


AllAsia . 44% of the world’s road deaths occur in Asia!° . India has the highest rate of TBI due to falls, and 
. The incidence rate of TBI in India is 160 per accounts for 50% of global falls'°® 
100,000; 1.6 million people will sustain a TBI’°® 
Tobacco use East and Southeast Asia . Prevalence of male smokers ranges from 36% in . Chronic nicotine consumption induces neuro- 
Singapore to 64% in Laos™; the rate is lower in adaptations in the brain’s reward system that result 
women*™* in nicotine dependence’” 
. Smoking in children aged between 13 and 15 is ° Withdrawal from nicotine can include somatic 
common in ASEAN® symptoms (for example, jumping, shaking, 
. Half of the world’s tobacco is consumed in Asia*® abdominal constrictions, chewing, scratching, and 
° South Asia has the highest use of smokeless facial tremors) or affective symptoms (for example 
tobacco worldwide*®® anhedonia)!”” 
° Past smokers are prone to relapse for weeks, months 
or even years after cessation’”” 
° Nicotine affects mood and cognition by stimulating 


nicotinic acetylcholine receptors on neurons in the 
brain’s mesolimbic reward system'”” 


ASEAN, Association of Southeast Asian Nations; NMDS, neurological, mental health, developmental and substance-use; TBI, traumatic brain injury. The 2014 population estimates for South Asia were 


1.692 billion and those for East Asia and Pacific region were 2.02 billion 


neuroscience is an interdisciplinary field and most LMICs do not have 
adequate training capacity. This, combined with the fact that the ex- 
pensive infrastructure needed for some areas of brain research is of- 
ten not available, drives many researchers from LMICs to migrate to 
high-income countries. 


REGION SPECIFIC RESEARCH NEEDS AND CHALLENGES 


‘There are specific needs across the regions that constitute LMICs, which 
have to be addressed in a region- and/or country specific manner. 


Identification of risk and protective factors 

There is an immediate need to characterize population groups that 
have increased susceptibility or resilience to brain disorders or bet- 
ter clinical outcomes, which could lead to the identification of dis- 
ease-modifying factors and interventions in other populations. 
Opportunities for research have been observed in different regions. 
For example, the course and outcome of schizophrenia is better un- 
derstood in India than in other countries”. The lifetime prevalence 
of PTSD as a major depressive disorder is not significantly greater in 
Southeast Asia compared with other parts of the world!, despite the 
region being a natural disasters-prone region. As a region with sig- 
nificant population growth trends, the likely increase in the number 
of people with childhood and adolescent disorders (including learn- 
ing disabilities) at one end of the spectrum and increasing lifespan 
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that leads to higher incidence of age-related neurodegenerative dis- 
orders (including dementia) at the other end make it imperative that 
resources are channelled to research aimed at identifying risk and 
protective factors*””. 


Integration of traditional methods of treatment 
Assessing the efficacy of indigenous, traditional Chinese medicine and 
Indian Ayurveda medicine for brain disorders is important. Integrating 
traditional Buddhist practices in the treatment of psychiatric disorders, 
such as the integration of mindfulness techniques into cognitive be- 
havioural therapy, has created new intervention approaches including 
mindfulness-based cognitive therapy”, mindfulness-based stress re- 
duction’, and dialectical behaviour therapy”. Similarly yoga, as an ad- 
dition to pharmacological interventions, is beneficial in the treatment 
of schizophrenia and depression’”””*. 


Collaborations and knowledge generation 

Opportunities have been made possible by improvements in infra- 
structure in sub-Saharan Africa, which sets the stage for cross-coun- 
try collaboration. For example, in addition to South Africa, several 
countries have neuroimaging facilities, which can be used to analyse 
brain structure and function to aid diagnosis and treatment. Malawi 
has excellent electroencephalography (EEG) services and the capac- 
ity to conduct longitudinal studies. Zambia has very good imaging 


19 November 2015 | 7578 | 527 


RAVINDRANATH ET AL. | REGIONAL RESEARCH 


Table 3 | Neurological, mental health, developmental and substance-use disorders and specific burden of disease in Latin America and the Caribbean 


on or disease 


Key affected countries 


Burden of disease 


Manifestation of conditi 


Unipolar depressive disorder 


Traumatic brain injury 


Dementia 


AlLLAC 


AILLAC 


AILLAC (higher in South America) 


AlLLAC 


Honduras, Panama, Chile, Peru and Colombia 


AIL LAC including large family groups in the 
Dominican Republic, Colombia, and Venezuela 


° One of the most common mental ° 
illness*, constituting 13.2% of the 
burden of total DALYs 

. Represents 35.7% among psychiatric 
disorders and is more prevalent among 
lower-income groups'” 


° One of the most common psychiatric ° 
disorders**, constituting 6.9% burden of 
total DALYS 

° Chile has the highest consumption rate ° 


of alcohol and tobacco” 


° Prevalence of smoking in men is 31% and 
in women is 17%"? 


° The region has one of the highest rates 
of injury mortality in the world?" 
. Has the highest incidence rates of 
traumatic brain injury caused by 
violence’”® 
. 17.8 (range, 6-43.2) per 1,000 people ° 


. Incidence is 77.7-190 per 100,000 people 
per year“, whereas in high-income 
countries it is 30-50 per 100,000 


. 6% of those over 60 years are affected” 
° Affects 2 million people, and likely to 
increase”” 


Typical manifestations are irritability, difficulty with 
concentration, fatigue or lack of energy, feelings of 
hopelessness and/or helplessness and sleep problems 


Alcohol consumption is a trigger of violence and 
accidents and is associated with 33% of intentional 
accidents and 26% of non-intentional accidents** 
Both associated with acute as well as long-term 
chronic conditions that range from brain damage, 
high blood pressure and stroke to liver and muscle 
diseases® 


Epilepsy is characterized by the appearance of 
primary generalized or partial seizures that begin 
with a widespread electrical discharge that involves 
one or both sides of the brain at once. Hereditary 
factors are important in many of these seizures 


DALYs, disability adjusted life years; LAC, Latin America and the Caribbean. The 2014 population estimates for Latin America were 521.9 million and for the Caribbean were 7.0 million 


Table 4 | Neurological, mental health, developmental and substance-use disorders and specific burden of disease in the Middle East and North Africa 


Burden of disease 


Unipolar depressive disorder 


Substance-use disorders 
(including nicotine, cannabis, 
alcohol and opiates) 


Epilepsy 


Key affected cou 


All countries 


Conflict zone countries 


All countries 


All countries 


All countries 


. Prevalence between 5-10%' 

. 15.3% Palestinian adults and children have 
depression” 

° The lifetime prevalence in Iraq is 7.2%” 

° 36% of adults in Iraq suffer from psychological 
trauma as a result of violence’ 

. A rate of 23.2% has been reported in Palestinian 
populations in the Gaza strip and Nablus district 
in the West Bank’® 

. Prevalence of 37.1% for Palestinian children’ 

. 80% of men and 67.8% of women in Yemen have 
used khat during their lifetime 

. There has been an increase in the prevalence 


of substance use in the Arabian Peninsula and 
East Africa, particularly among young adults and 
females 

. Tramadol use is a serious, growing public health 
problem in Egypt and other Middle East and 
North African countries (8.8% use among school 
children in Egypt)!" 

. Cannabis is the most commonly used drug 

. Ina 10 country study the tobacco smoking rate 
was 31.2%. The highest rates were in Jordan, 
Lebanon, Syria and Turkey” 


° Incidence is related to high consanguinity 
rates??? 

. Median prevalence is estimated to be 2.3 per 
1,000 (ref. 36) 

° In 23 Asian countries lifetime prevalence of 


epilepsy was 1.5 to 14 per 1,000 (ref. 54) 


° Women are more likely to have higher rates 
of depression than men 
e Disability, marital dysfunction, loss of 


employment and risk of suicide’® 


° Disability, loss of employment, disrupted 
family relationships and risk of substance 
misuse 

° Khat is implicated in depression, anxiety, 
psychosis and cognitive dysfunction®""* 

° Early first drug use leads to more drug 


problems later in life'*® 


0 Increased need for medical care 
0 Reduced lifespan 
° Increased family burden*** 


Prevalence is likely to be underreported because of 
stigma associated with the illness 


The 2014 population estimates Middle East and North Africa were 351.4 million. Population data from World Bank and aggregated by http://data.okfn.org/data/core/population 


and neurophysiology (EEG and nerve conduction velocity) facilities 
for adults and children, as well as the capacity for population-based 
studies in rural and urban centres and longitudinal cohort studies. In 
South Africa, a wide range of research techniques have been devel- 
oped, including EEG, electromyography, magnetic resonance imag- 
ing, diffusion tensor imaging, structural imaging, magnetic resonance 
spectroscopy, positron emission tomography and transcranial mag- 


netic stimulation. 
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LMICs is a major hindrance to 
tionate designation of health 


designate research budgets”. 


Health budgets and research funding 
A lack of adequate funding opportunities for neuroscience research in 


moving the field forward. The dispropor- 
spending in relation to variable national 


gross domestic product in LMICs makes it difficult to sustain or even 


For example, the order of the top three 


countries in sub-Saharan Africa — South Africa, Nigeria and Kenya — 


in terms of health research publications has remained unchanged for 
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Figure 2 | Number of neuroscience papers in international peer-reviewed jour- 
nals published by authors from Asian counties per year. The data were retrieved 
from http://www.scimagojr.com. 


the past 14 years, because of financial constraints imposed by total ex- 
penditure on health and the national gross domestic product®. Funding 
for NMDS disorders research is variable and depends on the priorities of 
the government agencies that fund health and/or science and technol- 
ogy research in general (where these exist). Three steps could be taken 
to promote neuroscience research in LMICs. First, governmental fund- 
ing for research through universities and research institutions should 
be enhanced and encouraged. Second, funds from national and inter- 
national non-governmental organizations (NGOs; which contribute up 
to 20% of all external aid for health services in developing countries, 
http://www.imva.org/Pages/biblfrm.htm) could be used to increase 
research opportunities in health and medicine, including epidemiolo- 
gy, clinical research, public health services and policy research. Third, 
increased collaboration with regional or international partners could 
lead to more research opportunities and support. 


CONCLUSIONS 


Regional variations in the challenges posed by NMDS disorders among 
LMICs means that research priorities need to be addressed coun- 
try-by-country, and by regions within countries. There are significant 
gaps between the resources needed for research and those that are cur- 
rently available, and a pressing need to strengthen human-resource 
capacity and research infrastructure, while promoting collaboration. 
Global demographic trends point to LMICs as the main work force of 
the future”; it is, therefore, imperative to act expeditiously to reduce 
the enormous burden of brain disorders in these countries. The loss of 
human potential and cost of inaction are unacceptably high. 
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Building global capacity for brain and nervous 
system disorders research 
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The global burden of neurological, neuropsychiatric, substance-use and neurodevelopmental disorders in low- and middle-in- 
come countries is worsened, not only by the lack of targeted research funding, but also by the lack of relevant in-country research 
capacity. Such capacity, from the individual to the national level, is necessary to address the problems within a local context. As 
for many health issues in these countries, the ability to address this burden requires development of research infrastructure and a 
trained cadre of clinicians and scientists who can ask the right questions, and conduct, manage, apply and disseminate research 
for practice and policy. This Review describes some of the evolving issues, knowledge and programmes focused on building re- 
search capacity in low- and middle-income countries in general and for brain and nervous system disorders in particular. 


Nature 527, S207-S213 (19 Month 2015), DOI: 10.1038/naturel16037 


This article has not been written or reviewed by Nature editors. Nature accepts no responsibility for the accuracy of the information provided. 


espite the current global burden of neurological, mental health 

(neuropsychiatric), developmental and substance use (NMDS) 

disorders, which is projected to increase, there is a lack of 
well-trained clinicians and scientists who focus on brain and nerv- 
ous system disorders research in low- and middle-income countries 
(LMICs)*¢ (see page S151). This workforce deficiency limits advances 
in research that can lead to new and improved interventions for those 
who are living with brain and nervous-system disorders. Notably, 
there are 200 times more neurologists per capita, and up to 160 times 
more psychiatrists per capita, in high-income countries than there 
are in LMICs”®. Brain disorders involve central nervous system (CNS) 
functioning, making things even more challenging. Key symptoms 
may involve both internalizing and externalizing behaviour. Inter- 
nalizing behaviour may be stigmatized and externalizing behaviour 
may be negatively stereotyped; both may be difficult to treat and as- 
sociated with poor prognosis. The stigma associated with NMDS dis- 
orders applies to both the patients and the clinician researchers who 
treat them, creating an additional barrier to building and sustaining 
research capacity. 

Thus, for health research in LMICs there is an urgent need to build 
new, and strengthen existing, individual, institutional and coun- 
try-wide research capabilities. This Review identifies certain key char- 
acteristics of capacity building, and the challenges and lessons learned 
based on the literature®!° and our own experience. 


BUILDING AND STRENGTHENING RESEARCH CAPACITY 
Research capacity building is a systematic, purposeful and goal-orient- 
ed effort to strengthen human resources and infrastructure to enable 
local scientists and institutions to become independent and responsive 
to existing and emerging health needs and threats’ ’. To be sustain- 
able and effective (and to address research-training sustainability 


concerns) a framework must be created so that research capacity is 
strengthened and woven together at the individual, institutional and 
national levels. To create research opportunity and a career pipeline, 
there needs to be a simultaneous focus on frameworks, goals and op- 
portunities. Embedding research into health systems requires a pro- 
cess that involves competent scientists and a supportive environment 
that enables research communities to flourish as they use new research 
tools that contribute to improving the health of the population”. This 
iterative process allows research to be responsive to population needs, 
and policies and practices to be responsive to research findings. 


INDIVIDUAL LEVEL 

At the core of capacity building is the training and mentoring of in- 
dividuals to design and conduct research; to create or adapt research 
tools that are relevant to brain-disorders research; to form collabora- 
tions with scientists in their institution, elsewhere in the country, and 
internationally; and eventually to serve as mentors themselves for the 
next generation of scientists (most effectively if they are within their 
home countries or region). Increasingly, researchers also need training 
on how to interact effectively with policy and programme implement- 
ers to ensure that they and their practices are adapted to the practices 
and policies locally. 

Capacity building at the individual level begins with mentors who 
counsel, provide career guidance, and advise on the teaching and 
sharing of ethical principles that instil integrity in research and care. 
Research-mentoring strategies and systems are context-driven, and 
often use apprenticeship or hands-on models, whereby individuals 
learn by doing. When little in-country research expertise and capacity 
exists, training may need to take place initially in a higher resource 
country. However, the goal should always be to have the research 
training in the context of the trainees’ home institution. 
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BOX 1 | BIOMEDICAL AND BIOBEHAVIOURAL 
RESEARCH ADMINISTRATOR DEVELOPMENT 
PROGRAMME 


The National Institutes of Health (NIH)/National Institute of 

Child Health and Human Development (NICHD) in collaboration 

with other NIH Institutions and Centers has made available the 
Biomedical/Biobehavioral Research Administrator Development 
(BRAD; http://www.nichd.nih.gov/about/org/od/ohe/brad/Pages/ 
overview.aspx) programme to establish new or strengthen existing 
offices of sponsored programmes (OSPs) or similar entities in low- and 
middle-income countries (LMICs). Enhanced OSPs in non-research 
intensive institutions are essential for the development of enabling and 
supportive environments in which faculty can develop robust research 
programmes and provide research experiences for students. 


BRAD objectives 

From a programme perspective, comprehensive and effective 
research administration is the bridge between research projects 
and a sustainable research enterprise at LMIC institutions. 
Accordingly, the BRAD programme objectives are to: 


e Encourage and support continuous professional development of 
OSPs, research administrators and grants managers at all levels. 

e Increase the effectiveness and productivity of OSPs (or similar 
entities) by promoting the use of best practices in research 
administration. 

e Promote OSP sustainability by identifying and addressing 
barriers to research and by supporting targeted faculty 
professional development that focuses on increasing 
competitiveness in obtaining external research support. 


In terms of systems, scientists initially educated in LMICs then 
trained in high-income countries say that they are accustomed to cer- 
tain ways of providing and receiving feedback, and interacting within 
hierarchical relationships. When they return to their home academ- 
ic environments, ideally they will adopt a hybrid mentoring style, 
combining positive attributes from both the country where they were 
trained and their home country, of which they have an awareness of 
local nuances, regarding local customs, politics and bureaucracy. 

Questioning and vigorous debate are integral to the scientific pro- 
cess, but young investigators from cultures that emphasize deference 
in hierarchical relationships may experience a conflict between these 
two values when they return home. 

In terms of research and clinical practice, performing a lumbar 
puncture to obtain cerebrospinal fluid (CSF), a requirement for the 
diagnosis of most CNS infections and some degenerative or develop- 
mental abnormalities, brain banking and autopsy can all be met with 
reluctance in LMICs. Extraction of fluids, tissue or organs for donation 
or banking for research purposes is associated with significant soci- 
ocultural barriers in some cultures. Several factors related to religion 
and culture, and issues related to distrust of the medical system, mis- 
understandings about religious stances and ignorance often compli- 
cate the process. Requests for brain or other organ banking for re- 
search purposes could raise concerns that agreeing to donation would 
discourage doctors from treatment to save lives among relatives or that 
consenting to banking would result in premature removal of their or 
their relative’s organs. 

To build capacity for these endeavours, mentors must be willing to 
not just advocate for these techniques, but also to address cultural bar- 
riers that may affect policies, as well as the discomfort of trainees who 
may not have experience of approaching relatives and patients about 
these procedures. 

The National Institutes of Health/Fogarty International Center 
Global Brain, NIDA International Fellowships and other programmes 
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(see Supplementary Table) are designed to help catalyse research ca- 
pacity development at an individual level (both as research training 
opportunities for young investigators, and as research pipeline oppor- 
tunities for more advanced LMIC investigators) to help prevent the loss 
of crucial talent and expertise. 


INSTITUTIONAL LEVEL 
Institutional capacity building is the administrative foundation and is 
essential for establishing and sustaining initiatives intended to realize 
its vision. Research infrastructure includes job positions that provide 
protected time for research, as well as robust laboratories and clini- 
cal spaces where diagnosis, treatment and research can be conduct- 
ed. Research into brain disorders, especially stroke, CNS infections, 
trauma and neurodegenerative conditions requires the technology to 
assess structural neurological abnormalities; for example, comput- 
erized tomography or magnetic resonance imaging may be non-ex- 
istent or prohibitively expensive in many LMIC settings. Although 
research-training grants typically provide the funding necessary to 
train new scientists and the equipment to increase laboratory capacity, 
larger infrastructure capacity-building endeavours, such as acquiring 
high-cost diagnostic neuroimaging or laboratory equipment, or con- 
structing new laboratories, clinics or classrooms, require the financial 
commitment of institutions with support from funders (ideally, and 
eventually, at the country level for maximum sustainability). 
Research and grants administration are crucial to the sustain- 
ability of research programmes within any institution, but good ad- 
ministrators can enhance the development of research capacity in re- 
source-challenged institutions (Box 1). 


Networks 

Research and research-training networks enrich the research environ- 
ment and build capacity by increasing collaborations and partnerships; 
expanding institutional perspectives from local and national levels to 
regional and perhaps the global level; and facilitating ideas exchange, 
dialogue and universal or standardized protocols for brain research’. 
Such networks are most effective when they attract not only individual 
scientists and academic institutions, but also non-governmental or- 
ganizations (NGOs), corporations, policymakers, and/or philanthro- 
pists, to sustain and embed the research enterprise within a country 
that is focused on a health issue, such as NMDS disorders. One example 
is the neuroscience promotion association APRONES. This association 
was established by a group of neurologists to share knowledge of dis- 
eases of the nervous system in LMICs. Members are from Africa, Eu- 
rope and the United States. The association encourages collaborative 
studies around the world, while building networks and ultimately re- 
search capacity”. 


NATIONAL LEVEL 
True sustainability of research capacity and its application requires a 
national commitment to the research enterprise and implementation 
of research results at the policy level, as well as a continuing dialogue 
between health practitioners, policymakers and researchers. However, 
often it is not until research and research-training networks are estab- 
lished that local government and NGOs recognize the benefit of talent 
and training to system-wide improvements and national human-re- 
source development”. At that point, they take actions to sustain it. 

According to the WHO’, work in support of the ethical review and 
public accountability of research is not keeping pace with best practic- 
es. Opportunities to create a shared framework for storing and sharing 
research data, tools and materials, have not been met with the same 
energy in the area of health as they have in other scientific fields. Fur- 
thermore, policymakers rarely understand research priorities or use 
evidence to inform their decisions. 

Without country-level planning and action, along with guidance 
documents, health research in LMICs may be influenced more by the 
demands of foreign funders’ research and infrastructure interests than 
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Table 1 | Approaches to building research capacity 


Activity General and specific needs 


Anticipated benefits 
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Specific approaches 


Increase the number of ° 
clinician researchers 


Increased number of physicians and allied 
health professionals in research benefits 

all neuroscience researchers, including 
neurologists, neurosurgeons, infectious disease 
specialists, psychiatrists and other mental health 
practitioners 

° As the incidence of neurological conditions 
increases, so will the need for more trained 
neurologists involved in research 


Human capacity ° 


° Increase research capacity ° As the neurosciences have not received as much 
of clinicians and researchers attention from research training grants, most 
through workshops and areas would benefit from increased funding for 
short courses; and advanced research training 
degrees in public health ° Health-systems research is needed, which 


necessitates training in bioethics, research 
biostatistics, clinical methodology, epidemiology, clinical trials, 

trials, health services and population-based methodology and intervention 
implementation science), studies 

clinical and basic science ° Specific areas of neuroscience with unique 
research needs, include mental health for which health- 


(for example, epidemiology, 


e Sub-specialized training services research is crucial to increase the 
on specific skills related to capacity of care services 
nervous-system disorders. 9 Many countries do not have the ability to 
diagnose neurogenetic conditions, and cannot 
provide genetic counselling or treatment 
. Institutionalization of ° Outstanding mentoring is a prerequisite of any 


successful research-training programme 

° Mentors must be expert in particular areas of 
research, such as cognitive assessment scales for 
the study of dementia-associated conditions, or 
seizure management for studies of epilepsy 


mentorship training 


Most neurological conditions require 
neuroimaging to confirm a diagnosis, disease 
stage or to monitor progress 


. Neuroimaging (for example, ° 
computerized tomography or 
magnetic resonance imaging) 


GWAS are used to identify genetic variations 
(SNPs) associated with neurological and 
psychological disorders, including addiction 


. Genomic sequencing to ° 
detect SNPs in GWAS 


Most studies of neurological diseases require 

at least a basic laboratory to process blood, 
cerebrospinal fluid or other human samples 

. With increasing complexity of studies, additional 
equipment is needed, such as polymerase chain 
reaction for detecting infectious pathogens or 
biosensors to detect environmental toxins 


e Increased laboratory capacity | ¢ 


. Culturally appropriate ° WHO and NIH databank of valid and reliable 
assessment and screening assessments for young people and adults. Each 
tools has an armamentarium of tools that are culturally 

appropriate 
° Stigma, social and health disparities are more 


common with disorders such as epilepsy and 
schizophrenia 


Practical experience and 
opportunities for future 
training’* 


Address the shared burdens of 
common conditions, including 
neurodegenerative disorders, 
stroke and epilepsy 
Development of 
multidisciplinary teams of 
health-care professionals 

to improve prevention, pre- 
hospital care, and clinical 
care in neuroscience, such 

as trauma, mental health or 
neurogenetics 


Capacity for conducting 
neuroscience research will 
increase as trainees move 

into positions where they will 
start mentoring subsequent 
generations of trainees 
Increasing numbers 

of scientists and the 
development of research 
teams, research culture and an 
increase in scientific literature 
and novel research 


Increased availability 

of neuroimaging will 

lead to better definition 

of the burden of many 
neurological conditions, such 
as stroke, CNS infections, 
developmental, degenerative 
and genetic disorders, and 
trauma 


Detection of specific genes 
through GWAS can lead to a 
better understanding of the 
functional mechanisms that 
are biologically important 

in disease pathogenesis 
and, ultimately, to better 
treatments for neurological 
diseases 


Many technologies introduced 
to increase laboratory 
capacity are also useful for 
diagnosing non-neurological 
diseases 


Disorders such as epilepsy and 
schizophrenia would benefit 
from increased recognition 

of barriers identified through 
culturally appropriate 
screening tools 


Create protected time and 
funding for research 
Decrease brain drain by 
investing in research and jobs 
in neurological areas 
Research methodology 
training during graduate 

and post-graduate medical 
training 


Multidisciplinary training and 
research 

The Wellcome trust-DBT 
india alliance fellowship 

for clinicians and research 
scientists 

Innovation in science 

pursuit for inspired research 
programme 

initiative in neuroclinical 
research education 

The African Brain Mapping 
and Therapeutics 

initiative, which advances 
neuroscience research in 
Africa by promoting global 
partnerships for brain-disease 
prevention and treatment 


Wide spread mentorship 
training (for example, through 
programmes such as those of 
the NIH Fogarty International 
Center http://www.fic.nih. 
gov/) 


Mentoring in the tools needed 
through flexible research 

and research-training 
programmes (for example 
through the Fogarty Global 
Brain programme) 


The US National Center for 
Biotechnology Information 
has developed the Database 
of Genotype and Phenotype, 
where genetic sequencing 
information can be deposited 
and accessed 


Enhancement of laboratory 
capacity often requires the 
upgrade of electrical systems, 
and with larger laboratories 
may also require installation 
of air-conditioning systems 


NIH and WHO promote 
scientific discovery, and 
shared resources, that allow 
for data harmonization across 
many programmes 

Network meetings with 
special interest groups value 
the use of unified concepts 
of addiction and mental 
health, from DSM to ICD 
classifications cont. 


by the health priorities of the host country”. In LMICs in general, and focusing research on key health challenges by setting priorities; 
Africa specifically, increasing the value of health research requiresev- strengthening national health research systems and building capac- 
idence-informed actions to be taken by relevant authorities to ensure ity; encouraging good research practice (setting standards) and con- 
that health research is conspicuous in development agendas. It also _ solidating links between health research and action (translation and 
requires defining, financing and monitoring a clear national plan fora | evidence-based implementation). 

future research enterprise focused on health. To achieve these goals, Needs and opportunities for building and strengthening capacity 
policymakers and public health and research-funding institutionscan for brain-disorders research are shown in Table 1. Although not ex- 
use principles adapted from the WHO Strategy on Research for Health’ haustive, they outline specific valuable approaches that can be used by 
as a guide. These provide the overall framework for research capac- high-income country and LMIC collaborators. 

ity and include reinforcing the research culture and organization; The Supplementary Table includes some organizations that are 
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Category 


Activity 


General and specific needs 


Anticipated benefits 


Miniaturization of diagnostic 
technologies for genomics, 
infectious agents and 
environmental markers will 
enable a better understanding 
of gene—-environment 
interactions and lead to new 
therapeutic approaches 


Improved access to electronic 
scientific literature should 
lead to more scientifically 
sound research and often 
leads to the creation of 
journal clubs, which in turn 
strengthens the culture of 
research 


Reaching patients with 
disabling neurological 
conditions using cell phones 
may prove easier than 
conventional methods for 
providing health information 


Specific approaches 


° Better sharing of data and 
instrumentation is needed, as 
well as collaborative grants 


° The HINARI Access 
to Research in Health 
programme provides free 
or low cost access to 200 
neuroscience journals for not- 
for-profits in LMICs, but the 
top ranking 50 journals are 
not available 

° Open access journals are 
available to everyone 


Share resources 
Offer classes for students at 
reduced cost 


Technology . Incorporation of emerging POC diagnostics could permit rapid diagnosis 
POC diagnostics from both of many neurological infections in the field, 
the development of cross- resulting in improved recognition and treatment 
cultural tools to the use of POC diagnostics can be used to non-invasively 
the tools monitor seizures, cerebral blood flow or 
intracranial pressure, but are not widely available 
for use in LMIC settings 
° Access to electronic scientific Common to all research is the need for 
literature understanding past and current scientific 
literature 
. Introduction of mHealth Adoption of mobile technologies for surveillance, 
technologies and e-learning assessments and treatment are particularly 
strategies needed in LMICs where cell phone ownership is 
rising rapidly, but access to conventional health 
care and health-care providers is limited 
Modular Internet-based curricula can be adapted 
for training for advancing neuroscience research 
Low-tech clinical simulation training emphasizing 
early life-saving interventions and procedures 
° Increase Internet capacity . Adapt information and communication ° 


training programmes 


° Pilot awards for LMIC . 
researchers 
more so 


° Increase governmental . 
funding for research through 
universities and research 
institutions 


e Research frameworks to . 
support the implementation 
of the outcomes of well- 
designed studies relevant to 
neurological diseases 


into the health-care system 


. Capacity building in ° 
translational science and 
knowledge management 

° Learn to package the evidence 
in a format more accessible to 
policymakers in LMICs 


training 


technologies to support research and research- 


Funding for research in LMICs is limited, but ° 
funding for neurological disease research is even 


Funds from national and international NGOs can . 


increase research opportunities 


Evidence-based public health strategies to ° 
incorporate child neurodisability screening, 
clinical evaluation and rehabilitation packages 


LMIC partners are asking for translational science | ¢ 


Video conferencing for direct | ¢ 
communication between 
mentors, colleagues and 
training in diverse settings 


Communication technologies 
include Skype, GoToMeeting, 
AdobeConnect, WhatsApp, 
Polycom and WebEx 


Providing funding to support e 
pilot studies to LMIC 
colleagues and trainees 
should lead to increased 
research relevant to the 
LMIC setting and provide 
pilot data on which larger 
grant applications could be 
developed 


Mentor LMIC partners 
through application 
processes. 


PEPFAR, UNAIDS, WHO, and 
the Bill and Melinda Gates 
Foundation have made drugs 
and services significantly 
more accessible 


Collaboration with foreign ° 
partners provides new 
research opportunities and 
support 


Maternal health programmes 
that work closely with early 
childhood programmes could 
ensure optimal pregnancy 
outcomes and develop 
effective interventions to 
enhance child development 


Involvement of the 
community, private sector 
and research sponsors during 
project planning establishes 
priorities, identifes research 
needs within the community, 
and identifies resources* 


Influence policymakers to ° 
redirect budget priorities to 
address brain disorders and 
their research 

. Involve community 
partnerships 


CNS, central nervous system; DSM, Diagnostic and Statistical Manual for Mental Disorder; GWAS, genome-wide association studies; ICD, International Classification of Diseases; LMIC, low- and middle- 
income countries; NGOs, non-governmental organizations; POC, point-of-care; SNP, single nucleotide polymorphisms; WHO, World Health Organization. 


investing in research-capacity building for brain disorders. Some 
long-term examples of programmes specifically focused on building 
a pipeline that stretches from individual to institutional to nation- 
al research capacity levels for nervous system diseases and disorders 
in LMICs are shown in Box 2. These programmes include the US NIH/ 
Fogarty coordinated Global Brain and Nervous System Diseases and 
Disorders Across the Lifespan Research Program and several Fogarty 
centre sponsored institutional research training programmes (focusing 
on masters, PhD and postdoctoral level training for LMIC investigators; 
Supplementary Table). 


ADDRESSING CHALLENGES TO CAPACITY BUILDING 

The framework to build the individual, institutional and national ca- 
pacity described requires principles of engagement, much like those 
for community engagement’. Those most relevant are the seven prin- 
ciples from the ESSENCE good practice document series!’ (Box 3). 
These core principles serve as a useful guide for funding agencies, the 
scientific community and academic institutions on how to move for- 
ward as they identify priorities, develop goals and objectives, design 
programmes and establish partnerships. They are also useful principles 
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to address various challenges in collaborations within and between 
countries and cultures, such as human, infrastructure, technological 
and ethical challenges. 


Human capacity challenges 
Capacity building across countries and cultures for brain-disorders re- 
search is inherently a ‘messy’ processes when we consider the scope of 
research with global partners across completely different time zones, 
infrastructures, cultural norms, expectations and organizational re- 
search capacities. Differences in language and expression of research 
ideas can lead to confusion and misunderstanding. When choosing 
terminology for assessments on depression, for example, well-de- 
veloped Western assessments use the words ‘feeling blue’ to indicate 
feelings of sadness. To discuss mania, the term ‘high’ might be used. 
These terms are idiomatic and do not translate well in many languages. 
Another example is the need for translators and interpreters in differ- 
ent sites within countries where multiple languages and dialects are 
spoken. 

Research training includes emphasizing flexibility to address such 
cultural challenges to research and working within local and regional 
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BOX 2 | ANATOMY OF A GLOBAL BRAIN RESEARCH 
FUNDING PROGRAMME 


Achievements of the National Institutes of Health/Fogarty 
International Centre coordinated global brain research programme 
(http://www.fic.nih.gov/About/Staff/Policy-Planning-Evaluation/ 
Pages/fogarty-program-evaluation-brain-disorders.aspx). 


The programme supports collaborative empirical research and 
capacity building on brain and nervous-system diseases and 

disorders identified by the applicants as relevant public health 
challenges in their low- and middle-income countries (LMICs). 


Programme’s achievements 
Research conducted over 10 years in 45 LMICs, most of which are in 
sub-Saharan Africa, Latin America and the Caribbean. 


Topics were across the spectrum, from mental health and substance 
use, to peripheral nervous system trauma and gene—-environment 
interactions. 


During the first 10 years of the programme, participants published 
435 peer-reviewed articles in 249 unique journals, as well as 14 
books or chapters. 


Grantees also produced unique tools for clinical assessment in the 
LMIC context, developed and evaluated new interventions, and 
identified novel laboratory tools or methods. 


Almost half of the projects supported training for people, who were 
not primary collaborators, in LMICs. The programme supported 
in-depth instruction for at least 138 scientists, for an average of 23 
months. 


Projects included training or mentoring at the LMIC (or sometimes 
a high-income country) site, in skills, methods or procedures that 
are essential to research, including workshops on specific topics, or 
clinical or research skills. 


Achieved mandatory training in research ethics, which built and 
sustained capacity in research ethics at most sites. 


norms (the ESSENCE principles shown in Box 3 can help). For example, 
when conducting a study on the prevalence of opiate use in Afghani- 
stan, two female interviewers were needed to conduct interviews with 
the female head of the household. Cultural norms dictated that the fe- 
male interviewers had to be accompanied by a male team member who 
would make the first contact with the household’. Designing research 
protocols that account for local cultural norms, while educating the 
high-income country and LMIC institutional review boards is neces- 
sary and builds trust and understanding between collaborators over 
time. Individual challenges can be resolved through open communica- 
tion and sharing of expectations at the outset. 


Infrastructure capacity challenges 

To conduct research, LMIC institutions require institutional review 
board committees, grant-management personnel, and data and doc- 
ument management capacities. These capacities vary widely across 
and within countries, but sufficient capacity in these areas is crucial 
to ensure fidelity to research protocols. Financial resource limitations 
and limited access to scientific and technical information are also key 
challenges. 

To overcome these barriers, high-income country and LMIC part- 
ners have strengthened institutional support by setting up meetings 
with presidents, deans and directors of institutes to advocate for more 
resources and to become less reliant on outside high-income country 
funding. Researchers have also set up channels of communication and 
collaboration whereby they can help each other in the grant application 
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BOX 3 | SEVEN PRINCIPLES FOR STRENGTHENING 
RESEARCH CAPACITY 


Based on the World Health Organization-TDR ESSENCE good 

practice document series”® 

1. Network, collaborate, communicate and share experiences 

2. Understand the local context and accurately evaluate existing 
research capacity 

3. Ensure local ownership and secure active support 

4. Build in monitoring, evaluation and learning from the start 

5. Establish robust research governance and support structure, and 
promote effective leadership 

6. Embed strong support, supervision and mentorship structures 

7. Think long-term, be flexible and plan for continuity 


process for Western-based grants. 

Investigators have also succeeded in seeking permanent access to 
library resources through high-income country institutions for their 
trainees. However, more sustainable access within and across LMICs to 
bridge the global information divide is needed. One source is HINARI”®. 


Technology capacity challenges 

Information and communication technology (ICT) has become in- 
creasingly integrated into research and clinical training. ICT involves 
a variety of technologies, including low-cost two-way voice, picture 
and video communication; development of geographic information 
systems, which are useful for planning interventions and mapping the 
prevalence of neurological conditions and risk factors*”*; Internet- 
and mobile-phone-based health-related interventions*, and Inter- 
net- and mobile-phone-based data collection*”. 

Online courses and degree programmes that have become in- 
corporated into most high-income country academic institutions 
have particular utility in LMICs where training infrastructure may be 
lacking or geographical barriers limit participation in conventionally 
structured research training programmes. The development of mas- 
sive open online courses (MOOCs) and bidirectional interactive vir- 
tual spaces permit multidisciplinary partnerships between students, 
faculty and mentors across institutions and countries. These provide 
new practical opportunities for bidirectional training, presentations 
and classroom-based discussions around the world (either as live or 
recorded sessions). 

The Internet allows research, clinical training and supervision to 
take place across the globe, and although the content and quality of 
the online material is important, the effectiveness of the supervision 
depends on the quality of the input, and learning ultimately rests on 
the ability and motivation of the trainee. Internet interventions have 
the potential to reduce manpower requirements, but without sufficient 
support, completion rates remain unacceptably low. There is a need to 
rigorously evaluate the use of these technologies in brain-disorders re- 
search training to ensure they are effective, acceptable and culturally 
relevant. 


Ethical challenges 

The field of neuroethics is a component of bioethics that deals with 
the investigation, treatment and research procedures that involve the 
human brain and brain science. The International Neuroethics Society 
(http://www.neuroethicssociety.org) was started to promote research 
that would benefit people around the world. 

Although all research training should include human subject re- 
search ethics, teams that focus on brain-disorders research face unique 
ethical challenges. People with neurological disorders are vulnerable, 
sometimes cognitively or physically challenged and often stigmatized, 
which creates special challenges when designing protocols that en- 
sure ethical informed consent. It is essential to address these special 
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BOX 4 | BIOETHICS TRAINING RESOURCES 


Resources for international bioethics research training curricula 
can be found through the Fogarty International Centre (Fogarty) 
International Research Ethics Education and Curriculum 
Development Award (or bioethics) programme (http://www.fic.nih. 
gov/ResearchTopics/Pages/Bioethics.aspx). 


Scientist Nandini Kumar was in the first class of trainees funded 

by the Fogarty programme at the University of Toronto (http:// 
www.fic.nih.gov/News/Examples/Pages/bioethics-india.aspx). 
After completing the course in 2002, Kumar was successful in 

her bid for a planning grant, and received a full Fogarty award to 
implement her programme in 2005. Over the first 7 years, Kumar's 
programme trained more than 2,000 scientists and health-care 
workers. More than 50 of them completed the intensive course 

and 2 earned master’s degrees. The programme has hosted nearly 
34 intensive workshops in 16 Indian cities. Its distance-learning 
programmes train up to 50 people each year. Many of the graduates 
have published papers, prepared curriculum for bioethics 
instruction at their own institutions, presented papers at national 
and international conferences, served as evaluators and set up or 
become members of ethics committees. As the field has developed, 
Kumar has become not only a national leader in bioethics, but also 
a member of international panels, including the US Presidential 
Commission for the Study of Bioethical Issues. 


challenges (some of which are similar to those faced by researchers 
working with populations affected by HIV/AIDS) in training curric- 
ula for research that is specific to brain disorders. The Fogarty funded 
Pakistani stroke research training programme has a dedicated neuro- 
ethics training module, in which every mentored project that involves 
mental health research has a bioethics programme (Box 4). 

Use of functional magnetic resonance imaging (fMRI), near-infra- 
red recording systems (NIRS), polygraphy to extract information, ge- 
netic testing and cognitive enhancement using drugs and brain stim- 
ulation are just a few examples of current and evolving technologies 
that raise moral and ethical questions. The application of these mod- 
ern techniques has been gaining momentum in the developing world, 
thereby indicating an urgent need to integrate neuroethics training 
into the neuroresearch capacity building efforts in LMICs. 


Metrics 

A robust set of metrics is crucial to demonstrate the value of research 
capacity building to diverse stakeholders and to understand how to 
make research-training activities the most effective. Output measures 
include educational materials such as courses, modules and work- 
shops; creation and transfer of new knowledge, such as prototypes 
and innovative protocols; and measuring trainee engagement indica- 
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Figure 1 | Research-capacity building activities can be achieved in a number of 
ways. Highlighted are examples of how three of activities can be achieved. EPI, 
epidemiology; GIS, geographical information systems; MOOCs; massive open 
online courses; RCR, responsible conduct of research. 
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tors (for example, number of short-, medium- and long-term train- 
ees taught, number of trainees completing courses or acquiring new 
skills and trainee feedback). As research training programmes mature, 
conventional metrics are needed, such as publications, grants, awards, 
memberships in societies, degrees awarded and faculty appointments. 

Measuring the long-term impact of building research capacity is 
a significant challenge. Many funding agencies have strict guidelines 
for tracking career successes of funded scholars for up to 15 years 
after training and evaluation frameworks that can be built into pro- 
grammes from the beginning to ensure trackable impacts (http:// 
www.fic.nih.gov/About/Staff/Policy-Planning-Evaluation/Pages/ 
evaluation-framework.aspx). Long-term impacts of successful capac- 
ity building include cultural competency of staff and faculty; increased 
involvement of staff and faculty in global health brain research; the ex- 
tent to which former trainees hold positions of influence in their coun- 
tries; leadership of former trainees in research and research collabo- 
rations; and increased knowledge of disorders and their significance 
locally and internationally. 


CONCLUSIONS 

Figure 1 summarizes some of the frameworks, components, path- 
ways and tools involved in research-capacity building. As described, 
research-capacity building starts at the individual level. Although 
partnerships between high-income and LMICs are important, the 
goal is for research training, as well as research itself, to increasingly 
take place at the LMIC sites and for those sites to become research and 
training hubs in their own right. 

Concurrent research capacity strengthening at the institutional 
and national levels is necessary to ensure research and career oppor- 
tunities for, and the retention of, trained researchers. With respect to 
NMDS disorders in particular, targeted programmes provide opportu- 
nities for LMIC clinicians, faculty and trainees to gain new skills for 
conducting relevant research and to contribute to long-term sustain- 
ability of research conducted in LMICs (as the trainees become the 
trainers and attention is paid to institutional strengths and weakness- 
es). Although challenges exist, they can be managed and eventually 
reduced or overcome using principles and models learned and shared 
across programmes®?. Robust evaluations of capacity-building activ- 
ities with quantitative and qualitative measures should be conducted, 
shared and used to identify the most successful approaches and to al- 
low iterative improvements in individual, institutional and national 
level NMDS research capacity. 
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